A Prototype Gutenberg-Hathitrust Sentence-Level Parallel Corpus for OCR Error Analysis: Pilot Investigations
Proceedings of the 22nd ACM/IEEE Joint Conference on Digital Libraries(2022)
Key words
sentence-level parallel corpus,optical character recognition,error analysis,digital libraries,digital humanities,data curation
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined