This is a scanned copy. Not the original pdf nor copy of original pdf. You make it searchable, but it's not the same as the other document we saw, which is like the original word document converted into PDF.
OCR is text recognition… used on a scanned document.
The redaction copying will not work on a scanned document even after OCR.
Redaction searchers: If the pdf document is not already searchable (i.e. you can copy and paste the text BEFORE OCR), then you're not going to be able to reveal the redaction via copy/paste.
Most of the FOIA/Vault/etc. docs to which we have access appear to be scans, and thus the copy/paste does not work. Scanned documents are easy to distinguish from the kind we want on first look, because you can see the imperfections from the scanning and you cannot copy/paste unredacted words without OCR.
me too. seems so simple.
but I never fully understood what Q meant by "you have more than you know."