Anonymous ID: 46eec3 Dec. 10, 2019, 2:56 p.m. No.7475522   🗄️.is 🔗kun   >>5550 >>5603 >>5608 >>5746 >>5941

OCR Errors

a few links, maybe useful for some of you diggers

 

http://www.infogridpacific.com/blog/igp-blog-20130317-ocr-production-nightmares.html

 

https://wiki.epfl.ch/bigdata2015-linguistic-drift-le-temps/ocr-correction

 

https://arxiv.org/pdf/1204.0191.pdf

 

Interesting:

https://en.wikipedia.org/wiki/Optical_character_recognition

Commissioned by the U.S. Department of Energy (DOE), the Information Science Research Institute (ISRI) had the mission to foster the improvement of automated technologies for understanding machine printed documents, and it conducted the most authoritative of the Annual Test of OCR Accuracy from 1992 to 1996.[31]