Files in this item



application/pdf2pt11_Ikeda-OCR.pdf (838kB)
(no description provided)PDF


Title:Human-assisted OCR of Japanese books with different kinds of microtasks
Author(s):Ikeda, Kosetsu; Hayashi, Ryota; Nagasaki, Kiyonori; Morishima, Atsuyuki
Subject(s):Digital transcription
Abstract:Human-assisted OCR is a common approach for transcribing books and has been used for many digital library projects. This paper reports our project for transcribing the book collections of National Diet Library in this approach. Our project is unique in two ways. First, we try to extend the human-assisted OCR approach by distributing microtasks in many ways other than just showing tasks in the specific Web page on PC screens. Second, we deal with Japanese books which have thousands of characters, some of which look similar to each other. This paper shows that we can expect high-quality results even if we transcribe Japanese texts with microtasks and the number of preformed microtasks to be stable if we distribute microtasks to equipment with witch worker perform microtasks in their daily lives.
Issue Date:2017
Citation Info:Ikeda, K., Hayashi, R., Nagasaki, K. & Morishima, A. (2017). Human-assisted OCR of Japanese Books with Different kinds of Microtasks. In iConference 2017 Proceedings, Vol. 2 (pp. 113-117).
Series/Report:iConference 2017 Proceedings Vol. 2
Genre:Conference Paper / Presentation
Rights Information:Copyright 2017 is held by the authors.
Date Available in IDEALS:2017-12-05

This item appears in the following Collection(s)

Item Statistics