Files in this item

FilesDescriptionFormat

application/pdf

application/pdf2pt11_Ikeda-OCR.pdf (838kB)
(no description provided)PDF

Description

Title:Human-assisted OCR of Japanese books with different kinds of microtasks
Author(s):Ikeda, Kosetsu; Hayashi, Ryota; Nagasaki, Kiyonori; Morishima, Atsuyuki
Subject(s):Digital transcription
Crowdsourcing
Microtasks
Abstract:Human-assisted OCR is a common approach for transcribing books and has been used for many digital library projects. This paper reports our project for transcribing the book collections of National Diet Library in this approach. Our project is unique in two ways. First, we try to extend the human-assisted OCR approach by distributing microtasks in many ways other than just showing tasks in the specific Web page on PC screens. Second, we deal with Japanese books which have thousands of characters, some of which look similar to each other. This paper shows that we can expect high-quality results even if we transcribe Japanese texts with microtasks and the number of preformed microtasks to be stable if we distribute microtasks to equipment with witch worker perform microtasks in their daily lives.
Issue Date:2017
Publisher:iSchools
Citation Info:Ikeda, K., Hayashi, R., Nagasaki, K. & Morishima, A. (2017). Human-assisted OCR of Japanese Books with Different kinds of Microtasks. In iConference 2017 Proceedings, Vol. 2 (pp. 113-117). https://doi.org/10.9776/17222
Series/Report:iConference 2017 Proceedings Vol. 2
Genre:Conference Paper / Presentation
Type:Text
Language:English
URI:http://hdl.handle.net/2142/98864
DOI:https://doi.org/10.9776/17222
Rights Information:Copyright 2017 is held by the authors.
Date Available in IDEALS:2017-12-05


This item appears in the following Collection(s)

Item Statistics