Withdraw
Loading…
Interactive Learning Protocols for Natural Language Applications
Small, Kevin M.
Loading…
Permalink
https://hdl.handle.net/2142/13854
Description
- Title
- Interactive Learning Protocols for Natural Language Applications
- Author(s)
- Small, Kevin M.
- Issue Date
- 2009-10-02
- Doctoral Committee Chair(s)
- Roth, Dan
- Committee Member(s)
- DeJong, Gerald F.
- Hockenmaier, Julia
- McCallum, Andrew
- Department of Study
- Computer Science
- Discipline
- Computer Science
- Degree Granting Institution
- University of Illinois at Urbana-Champaign
- Degree Name
- Ph.D.
- Dissertation
- Keyword(s)
- machine learning
- active learning
- interactive learning
- natural language processing
- Language
- en
- Abstract
- Statistical machine learning has become an integral technology for solving many informatics applications. In particular, corpus-based statistical techniques have emerged as the dominant paradigm for core natural language processing (NLP) tasks such as parsing, machine translation, and information extraction, amongst others. However, while supervised machine learning is well understood, its successful application to practical scenarios is predicated on obtaining large annotated corpora and performing significant feature engineering, both notably expensive undertakings. Interactive learning protocols offer one promising solution for reducing these costs by allowing the learner and domain expert to interact during learning in an effort to both reduce sample complexity and improve system performance. By specifying a method where the learner may request targeted information, the domain expert is focused on providing the most useful information. This work formalizes a general framework for interactive learning and examines two interactive learning protocols with particular attention to natural language scenarios. We first examine active learning for structured output spaces, the scenario where there are multiple predictions which must be composed into a structurally coherent global prediction. Secondly, we examine active learning for pipeline models, where a complex prediction is decomposed into a sequence of predictions where each stage explicitly relies on the output of previous stages. These two widely-used models are particularly applicable for complex application scenarios where obtaining labeled data is particularly expensive. By allowing the learner to select which examples to label, we demonstrate significant reductions in sample complexity for both semantic role labeling and an entity/relation extraction task. Secondly, we introduce the interactive feature space construction protocol, which uses a more sophisticated interaction to incrementally add application-targeted domain knowledge to the feature space. Whereas active learning restricts the interaction to additional labeled data, the interactive feature space construction protocol better utilizes the domain expert by focusing direct modification of the feature space to improve performance and reduce sample complexity. Through this protocol, we demonstrate further improvements on our entity/relation extraction system.
- Type of Resource
- text
- Permalink
- http://hdl.handle.net/2142/13854
- Copyright and License Information
- Copyright 2009 Kevin Small
Owning Collections
Dissertations and Theses - Computer Science PRIMARY
Dissertations and Theses from the Dept. of Computer ScienceManage Files
Loading…
Edit Collection Membership
Loading…
Edit Metadata
Loading…
Edit Properties
Loading…
Embargoes
Loading…