Taxonomic Name Recognition in Biodiversity Heritage Library
Wei, Qin; Freeland, Chris; Heidorn, P. Bryan
Loading…
Permalink
https://hdl.handle.net/2142/9140
Description
Title
Taxonomic Name Recognition in Biodiversity Heritage Library
Author(s)
Wei, Qin
Freeland, Chris
Heidorn, P. Bryan
Issue Date
2008-10-17
Keyword(s)
Taxonomic Name Recognition
Biodiversity Data
Abstract
Taxonomic Name Recognition (TNR) algorithm – identifying a text string as a taxonomic name or not and recognizing the boundaries of the name – is very important in BHL digitization project in determining whether the users/researchers could find the materials they want efficiently. The BHL has incorporated TaxonFinder, a taxonomic name finding algorithm and service provided by uBio.org, into its portal for the identification and verification of taxonomic name strings found within the digitized BHL corpus. An eight-week evaluation was performed to determine the factors affecting the accuracy of the results returned. Our findings are not only valuable for BHL but also for other digital projects that would like to do text mining on their collections. In this evaluation project, we explored and analyzed the factors influencing the performance of: 1) Optical Character Recognition (OCR) for transforming images into text, 2) TNR matching algorithms for identifying taxonomic names from texts, and 3) the completeness of NameBank, which is used as an authority file for name verification.
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.