University of Illinois Urbana-Champaign

The Gutenberg-HathiTrust Parallel Corpus: A Real-World Dataset for Noise Investigation in Uncorrected OCR Texts

Jiang, Ming; Hu, Yuerong; Worthey, Glen; Dubnicek, Ryan C.; Capitanu, Boris; Kudeki, Deren; Downie, J. Stephen

Content Files
iConference21_poster580.pdf
Jiang-The Gutenberg-HathiTrust Parallel Corpus-580.zip
Loading…

Permalink

Description

Owning Collections