Interactive Learning Protocols for Natural Language Applications

Small, Kevin M.

Interactive Learning Protocols for Natural Language Applications

Small, Kevin M.

Permalink

https://hdl.handle.net/2142/13854

Description

Title

Interactive Learning Protocols for Natural Language Applications

Author(s)

Small, Kevin M.

Issue Date

2009-10-02

Doctoral Committee Chair(s)

Roth, Dan

Committee Member(s)

DeJong, Gerald F.
Hockenmaier, Julia
McCallum, Andrew

Department of Study

Computer Science

Discipline

Computer Science

Degree Granting Institution

University of Illinois at Urbana-Champaign

Degree Name

Ph.D.
Dissertation

Date of Ingest

2009-10-02T17:40:43Z

Keyword(s)

machine learning
active learning
interactive learning
natural language processing

Language

Abstract

Statistical machine learning has become an integral technology for solving many informatics applications. In particular, corpus-based statistical techniques have emerged as the dominant paradigm for core natural language processing (NLP) tasks such as parsing, machine translation, and information extraction, amongst others. However, while supervised machine learning is well understood, its successful application to practical scenarios is predicated on obtaining large annotated corpora and performing significant feature engineering, both notably expensive undertakings. Interactive learning protocols offer one promising solution for reducing these costs by allowing the learner and domain expert to interact during learning in an effort to both reduce sample complexity and improve system performance. By specifying a method where the learner may request targeted information, the domain expert is focused on providing the most useful information. This work formalizes a general framework for interactive learning and examines two interactive learning protocols with particular attention to natural language scenarios. We first examine active learning for structured output spaces, the scenario where there are multiple predictions which must be composed into a structurally coherent global prediction. Secondly, we examine active learning for pipeline models, where a complex prediction is decomposed into a sequence of predictions where each stage explicitly relies on the output of previous stages. These two widely-used models are particularly applicable for complex application scenarios where obtaining labeled data is particularly expensive. By allowing the learner to select which examples to label, we demonstrate significant reductions in sample complexity for both semantic role labeling and an entity/relation extraction task. Secondly, we introduce the interactive feature space construction protocol, which uses a more sophisticated interaction to incrementally add application-targeted domain knowledge to the feature space. Whereas active learning restricts the interaction to additional labeled data, the interactive feature space construction protocol better utilizes the domain expert by focusing direct modification of the feature space to improve performance and reduce sample complexity. Through this protocol, we demonstrate further improvements on our entity/relation extraction system.

Type of Resource

text

Permalink

http://hdl.handle.net/2142/13854

Copyright and License Information

Owning Collections

Dissertations and Theses - Computer Science PRIMARY

Dissertations and Theses from the Siebel School of Computer Science

Interactive Learning Protocols for Natural Language Applications

Small, Kevin M.

Permalink

Description

Owning Collections

Dissertations and Theses - Computer Science PRIMARY

Log In