Files in this item

FilesDescriptionFormat

application/pdf

application/pdfStahlman_Sheffield_Poster.pdf (154kB)
(no description provided)PDF

Description

Title:Geoparsing biodiversity heritage library collections: A preliminary exploration
Author(s):Stahlman, Gretchen Renee; Sheffield, Carolyn
Subject(s):Biodiversity Heritage Library
Geoparsing
Biodiversity
Text mining
Data
Abstract:A short pilot study was conducted to provide recommendations on methods and workflows for extracting geographic references from the text of Biodiversity Heritage Library collections and disambiguating these references. An initial survey of the literature was conducted, and a variety of possible techniques and software were subsequently explored for natural language processing, machine learning, document annotation, and map visualization. A test corpus was evaluated, and preliminary findings identify challenges for a full-scale effort towards automated geoparsing, including: varying OCR quality, diversity of the corpus, historical context, and ambiguity of geographic references. The project background, approaches, and preliminary assessment are described here.
Issue Date:2019-03-15
Publisher:iSchools
Series/Report:iConference 2019 Proceedings
Genre:Conference Poster
Type:Text
Language:English
URI:http://hdl.handle.net/2142/103357
DOI:https://doi.org/10.21900/iconf.2019.103357
Rights Information:Copyright 2019 Gretchen Renee Stahlman and Carolyn Sheffield
Date Available in IDEALS:2019-03-22


This item appears in the following Collection(s)

Item Statistics