Towards robust clinical predictive modeling with heterogeneous electronic health record data
Wu, Zhenbang
This item is only available for download by members of the University of Illinois community. Students, faculty, and staff at the U of I may log in with your NetID and password to view the item. If you are trying to access an Illinois-restricted dissertation or thesis, you can request a copy through your library's Inter-Library Loan office or purchase a copy directly from ProQuest.
Permalink
https://hdl.handle.net/2142/125712
Description
Title
Towards robust clinical predictive modeling with heterogeneous electronic health record data
Author(s)
Wu, Zhenbang
Issue Date
2024-07-15
Director of Research (if dissertation) or Advisor (if thesis)
Sun, Jimeng
Department of Study
Siebel Computing &DataScience
Discipline
Computer Science
Degree Granting Institution
University of Illinois at Urbana-Champaign
Degree Name
M.S.
Degree Level
Thesis
Keyword(s)
deep learning for healthcare
clinical predictive modeling
electronic health records
Abstract
With the widespread adoption of Electronic Health Record (EHR) systems, there has been increasing interest in leveraging deep learning for clinical predictive modeling. However, existing models typically assume a uniform feature and label space. In contrast, different hospitals often use varied EHR systems with unique schemas (i.e., feature space). Additionally, the clinical tasks (i.e., label space) can change dynamically. In this work, we present two methods to address discrepancies in feature and label spaces across different healthcare settings. AutoMap is designed to enable the deployment of clinical predictive models across hospitals with diverse medical coding systems. It automatically aligns medical codes across different EHR systems via ontology-level alignment and code-level refinement. EDGE is designed to recommend newly developed drugs, which often lack extensive historical prescription data. By formulating new drug recommendation as a few-shot learning problem, it employs a drug-dependent multi-phenotype few-shot learner to quickly adapt to new drugs. We validate both methods using real-world EHR datasets from MIMIC-III, MIMIC-IV, eICU, and Claims databases. Our results demonstrate their effectiveness in addressing the challenges posed by unmatched feature and label spaces in clinical predictive modeling.
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.