Files in this item

FilesDescriptionFormat

application/pdf

application/pdfGodbyEtAl.pdf (638kB)
Main articlePDF

Description

Title:Who's Who in Your Digital Collection: Developing a Tool for Name Disambiguation and Identity Resolution
Author(s):Godby, Carol Jean; Hswe, Patricia; Jackson, Larry S.; Klavans, Judith; Ratinov, Lev; Roth, Dan; Cho, Hyoungtae
Subject(s):named entity recognition
entity resolution
metadata generation
name disambiguation
Abstract:In the past twenty years, the problem space of automatically recognizing, extracting, classifying, and disambiguating named entities (e.g., the names of people, places, and organizations) from digitized text has received considerable attention in research produced by the library, computer science, and the computational linguistics communities. However, linking the output of these advances with the library community continues to be a challenge. This paper describes work being done by the University of Illinois, the Online Computer Library Center (OCLC), and the University of Maryland to develop, evaluate and link Named Entity Recognition (NER) and Entity Resolution with tools used for search and access. Name identification and extraction tools, particularly when integrated with a resolution into an authority file (e.g., WorldCat Identities, Wikipedia, etc.), can enhance reliable subject access for a document collection, improving document discoverability by end-users.
Issue Date:2009
Publisher:2009 Chicago Colloquium on Digital Humanities and Computer Science
Citation Info:Godby, C.J., Hswe, P., Jackson, L., Klavans, J., Ratinov, L., Roth, D., Cho, H. (2009). Who’s Who in Your Digital Collection: Developing a Tool for Name Disambiguation and Identity Resolution. In Proceedings of the 2009 Chicago Colloquium on Digital Humanities and Computer Science. Publication pending.
Genre:Conference Paper / Presentation
Type:Text
Language:English
URI:http://hdl.handle.net/2142/15393
Publication Status:published or submitted for publication
Peer Reviewed:is peer reviewed
Sponsor:Library of Congress / NDIIPP-2 A6075
Date Available in IDEALS:2010-04-13


This item appears in the following Collection(s)

Item Statistics