Files
pdf_signature_extraction/signature_analysis
gbanyan 453f1d8768 Phase 3 close-out: Script 42 + §IV draft v2 (Table XV filled)
Script 42 tabulates the §III-L five-way per-signature classifier
output on the Big-4 sub-corpus (n=150,442 signatures classified)
and aggregates to document-level (n=75,233 unique PDFs) under
the worst-case rule.

Per-signature five-way overall (Table XV):

  HC  74,593  49.58%  high-confidence non-hand-signed
  MC  39,817  26.47%  moderate-confidence non-hand-signed
  HSC    314   0.21%  high style consistency
  UN  35,480  23.58%  uncertain
  LH     238   0.16%  likely hand-signed

Per-firm five-way (% within firm):

  Firm A (Deloitte)  HC 81.70%, MC 10.76%, UN 7.42%
  Firm B (KPMG)      HC 34.56%, MC 35.88%, UN 29.09%
  Firm C (PwC)       HC 23.75%, MC 41.44%, UN 34.21%
  Firm D (EY)        HC 24.51%, MC 29.33%, UN 45.65%

Document-level (Table XV-B, NEW):

  HC  46,857  62.28%
  MC  19,667  26.14%
  HSC    167   0.22%
  UN   8,524  11.33%
  LH      18   0.02%
  Total 75,233 unique Big-4 PDFs (single-firm 74,854; mixed-firm 379)

§IV v2 changes vs v1:
  - Table XV populated with Script 42 counts
  - Table XV-B (NEW): document-level worst-case counts
  - Per-firm five-way breakdown (% within firm) added
  - Per-firm document-level breakdown added
  - Document-level paragraph in §IV-J updated to reference Table XV-B
  - Phase 3 close-out checklist: item 1 (Table XV TBD) and item 4
    (document-level counts) marked RESOLVED; remaining items reduced
    from 5 to 3 (renumbering, content audit, codex open-questions)

The per-firm pattern is consistent with the §III-K Spearman-and-
cluster ordering: Firm A's signatures concentrate in HC (81.7%),
the three non-Firm-A firms have markedly lower HC and substantially
higher Uncertain rates (29-46%), with Firm D having the highest
Uncertain rate of the Big-4 -- consistent with the reverse-anchor
score (§III-K Score 2) ranking Firm D fractionally above Firm C in
the hand-leaning direction.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-12 16:45:22 +08:00
..