Files in this item



application/pdfJIANG-THESIS-2015.pdf (298kB)
(no description provided)PDF


Title:Structured entity querying over unstructured text
Author(s):Jiang, Jiahui
Advisor(s):Chang, Kevin Chen-chuan
Department / Program:Computer Science
Discipline:Computer Science
Degree Granting Institution:University of Illinois at Urbana-Champaign
Subject(s):structured query
unstructured dataset
Abstract:The web is a collection of unstructured webpages. This characteristic makes it very hard for users to search complex queries -- in most of the times, queries are just plain text and it is very difficult for users to describe the internal relationships that they contain. The EntityRank system allows users to specify the target entities for which they are interested within the query, which brings us one step closer to being able to describe the entity relationships. But this expressiveness is still limited because the queries are still in a flat format. On the other hand, SQL queries for relational databases are very expressive. Users can easily specify multiple entities and the relationships between them. But in order to use SQL queries, we need to extract all the entities and relationships existing in the domain beforehand. The cost of maintaining the tables is very large. Also it is very hard if we want to add or modify the schema after the initial domain design. Thus we want to build a system that can combine the advantages of the flexibility and informativeness of unstructured webpages and the expressiveness of SQL queries. In this work we design a system which allows users to use structured SQL queries on unstructured webpages using the help of the EntityRank system. We design the conceptual framework to map the concepts between two systems and also propose a ranking algorithm for the final results.
Issue Date:2015-07-16
Rights Information:Copyright 2015 Jiahui Jiang
Date Available in IDEALS:2015-09-29
Date Deposited:August 201

This item appears in the following Collection(s)

Item Statistics