I'm finally up to speed on the raw Q data file cleanout. What a job!!!! But I have 259 postings from 4Chan (the totality of them); all are double-verified for date, time, content, message number, and attached images. ALL referenced messages are present as well (one level deep of nesting). I can no longer use the 4Chan feature so I'm hoping the remainder can be accessed via the Github page and/or the spreadsheet. NOTHING in this file is verified except off the original postings. To avoid the "copy of a copy" degradation problem.
I've cleaned up many errors, including wrong text for the message number. This round has 100% confidence in content.
8Chan messages remain. It should go much faster since I can use the links in the spreadsheet or the GitHub page.
When it's all done, it'll be publicly available in (hopefully) several different database file formats so that anybody who wants to work with the data via software development will have that data immediately available.