Files in this item



application/pdfGAO-DISSERTATION-2019.pdf (5MB)
(no description provided)PDF


Title:Extracting and utilizing hidden structures in large datasets
Author(s):Gao, Yihan
Director of Research:Parameswaran, Aditya
Doctoral Committee Chair(s):Parameswaran, Aditya
Doctoral Committee Member(s):Chang, Kevin; Sundaram, Hari; Wang, Jiannan
Department / Program:Computer Science
Discipline:Computer Science
Degree Granting Institution:University of Illinois at Urbana-Champaign
Subject(s):Structure Extraction
Automatic Data Processing
Abstract:The hidden structure within datasets --- capturing the inherent structure within the data not explicitly captured or encoded in the data format --- can often be automatically extracted and used to improve various data processing applications. Utilizing such hidden structure enables us to potentially surpass traditional algorithms that do not take this structure into account. In this thesis, we propose a general framework for algorithms that automatically extract and employ hidden structures to improve data processing performance, and discuss a set of design principles for developing such algorithms. We provide three examples to demonstrate the power of this framework in practice, showcasing how we can use hidden structures to either outperform state-of-the-art methods, or enable new applications that are previously impossible. We believe that this framework can offer new opportunities for the design of algorithms that surpass the current limit, and empower new applications in database research and many other data-centric disciplines.
Issue Date:2019-03-26
Rights Information:Copyright 2019 Yihan Gao
Date Available in IDEALS:2019-08-23
Date Deposited:2019-05

This item appears in the following Collection(s)

Item Statistics