Files in this item



application/pdf8409916.pdf (6MB)Restricted to U of Illinois
(no description provided)PDF


Title:Page Indexing for Textual Information Retrieval Systems
Author(s):Emrath, Perry Alan
Department / Program:Computer Science
Discipline:Computer Science
Degree Granting Institution:University of Illinois at Urbana-Champaign
Subject(s):Computer Science
Abstract:A number of applications exist for systems which can store and interactively retrieve from very large natural language textual databases. This thesis discusses conventional approaches to the design of such systems. The notion of page indexing is introduced as a new scheme for doing information retrieval from natural language full-text databases.
The structure of a page indexed database is described and the algorithms needed to do retrieval using the page index are presented. Some characteristics of page indexed text are analyzed and measured in order to estimate the size of the page index, and to show how the size of the index is related to the page size. One of the advantages of the page indexing scheme is the ease with which such a system can be analyzed. This analysis is based on characteristics of the hardware used to implement the system and on characteristics of queries. Finally, three hypothetical systems are proposed and analyzed using the techniques and methodologies developed in this thesis. These systems range from a microprocessor for a database of 250 megabytes to a large computer system employing multiple special purpose processors for a database of 50 gigabytes.
Issue Date:1983
Description:150 p.
Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1983.
Other Identifier(s):(UMI)AAI8409916
Date Available in IDEALS:2014-12-15
Date Deposited:1983

This item appears in the following Collection(s)

Item Statistics