Files in this item



application/pdfUIUCDCS-R-2009-3043.pdf (553kB)
(no description provided)PDF


Title:Effective Ranking of XML Keyword Search Results
Author(s):Termehchy, Arash; Winslett, Marianne
Subject(s):Computer Science
Abstract:The popularity of XML has exacerbated the need for an easy-to-use, high precision query interface for XML data. When traditional document-oriented keyword search techniques do not suffice, natural language interfaces and keyword search techniques that take advantage of XML structure make it very easy for ordinary users to query XML databases. Unfortunately, current approaches to processing these queries rely heavily on heuristics that are intuitively appealing but ultimately ad hoc. These approaches often retrieve false positive answers, overlook correct answers, and cannot rank answers appropriately. To address these problems for data-centric XML, we propose {\it coherency ranking}, a domain- and database design-independent ranking method for XML keyword queries that is based on an extension of the concepts of data dependencies and mutual information. With coherency ranking, the results of a keyword query are invariant under schema reorganization. We analyze the way in which previous approaches to XML keyword search approximate coherency ranking, and present efficient algorithms to process queries and rank their answers using coherency ranking. Our empirical evaluation with two real-world XML data sets shows that coherency ranking has better precision and recall and provides better ranking than all previous approaches. Coherency ranking can also be used for keyword queries over relational and graph data.
Issue Date:2009-03
Date Available in IDEALS:2009-04-15

This item appears in the following Collection(s)

Item Statistics