295,224 documents (22.6% of the corpus) show suppression pattern indicators (D8), but only 811 (0.06%) show explicit redaction patterns (V10). This 364:1 ratio is the single most significant statistiβ¦
Across 1,306,136 classified documents, Detection-phase questions (D1-D8) generated 491,140 tag hits while Verification-phase questions (V9-V16) generated only 320,928. This 1.5:1 ratio means the archβ¦
10 of the 24 Questions appear in less than 1% of the corpus. These rare signals are disproportionately valuable β each document tagged with them is a needle in a 1.3-million-document haystack. The raβ¦
This analysis examines 30 documents tagged with V10, focusing on redaction patterns. Key observations include frequent redactions related to personal identifiers, sensitive financial information, andβ¦