Files in this item

FilesDescriptionFormat

application/pdf

application/pdfPrateek_Jindal.pdf (665kB)
(no description provided)PDF

Description

Title:Information extraction for clinical narratives
Author(s):Jindal, Prateek
Director of Research:Roth, Dan
Doctoral Committee Chair(s):Roth, Dan
Doctoral Committee Member(s):Gunter, Carl A.; Zhai, ChengXiang; Chapman, Wendy
Department / Program:Computer Science
Discipline:Computer Science
Degree Granting Institution:University of Illinois at Urbana-Champaign
Degree:Ph.D.
Genre:Dissertation
Subject(s):Natural Language Processing
Electronic Health Records
Mention Detection
Coreference Resolution
Drug Abuse Events
Set Expansion
Integer Linear Programming
Integer Quadratic Programming
Information Extraction
Text Mining
Clinical Narratives
Temporal Expression Extraction
Joint Inference
Abstract:Recent US government initiatives have made available a large number of Electronic Health Records (EHRs). These EHRs contain valuable information which can be used in Clinical Decision Support (CDS). So, Information Extraction (IE) from EHRs is a very promising research area. In this thesis, I focus on two tasks namely Mention Detection and Coreference Resolution. A lot of domain knowledge is available regarding clinical narratives. There are also several tools like SpecialistLexicalTools, MetaMap, etc. which help in analyzing clinical narratives. I integrate the domain knowledge and features derived from these tools in the local statistical models. Clinical narratives have a very special format which gives several interconnections between the tasks of mention detection and coreference resolution. A joint formulation for these two tasks has been presented in this thesis. Along with this, there is also a discussion regarding joint formulation for finding the mention types together. Soft constraints have been used while formulating the inference tasks. Softening the constraints is helpful because it allows the constraints to be violated during inference. Joint formulation is based on the fact that only local models are learned in the training phase. Inconsistencies in the decisions based on local models are resolved during the global inference step. I report the best results, to date, on end-to-end coreference resolution. The joint formulation presented in this thesis is very general and would benefit other information extraction tasks as well. I have made the systems described in this thesis publicly available for research use.
Issue Date:2014-01-16
URI:http://hdl.handle.net/2142/46618
Rights Information:Copyright 2013 Prateek Jindal
Date Available in IDEALS:2014-01-16
Date Deposited:2013-12


This item appears in the following Collection(s)

Item Statistics