Files in this item

FilesDescriptionFormat

application/pdf

application/pdf8823208.pdf (5MB)Restricted to U of Illinois
(no description provided)PDF

Description

Title:Storage Capacity of the Linear Associator: Beginnings of a Theory of Computational Memory
Author(s):Mumme, Dean C.
Doctoral Committee Chair(s):Schneider, Walter
Department / Program:Computer Science
Discipline:Computer Science
Degree Granting Institution:University of Illinois at Urbana-Champaign
Degree:Ph.D.
Genre:Dissertation
Subject(s):Computer Science
Abstract:This thesis presents a characterization of a simple connectionist-system, the linear-associator, as both a memory and a classifier. Toward this end, a theory of memory based on information-theory is devised. The principles of the information-theory of memory are then used in conjunction with the dynamics of the linear-associator to discern its storage capacity and classification capabilities as they scale with system size. To determine storage capacity, a set of M vector-pairs called "items" are stored in an associator with N connection-weights. The number of bits of information stored by the system is then determined to be about (N/2)log$\sb2$M. The maximum number of items storable is found to be half the number of weights so that the information capacity of the system is quantified to be (N/2)log$\sb2$N.
Classification capability is determined by allowing vectors not stored by the associator to appear at its input. Conditions necessary for the associator to make a correct response are derived from constraints of information theory and the geometry of the space of input-vectors. Results include derivation of the information-throughput of the associator, the amount of information that must be present in an input-vector and the number of vectors that can be classified by an associator of a given size with a given storage load.
Figures of merit are obtained that allow comparison of capabilities of general memory/classifier systems. For an associator with a simple non-linearity on its output, the merit figures are evaluated and shown to be suboptimal. Constant attention is devoted to relative parameter size required to obtain the derived performance characteristics. Large systems are shown to perform nearest the optimum performance limits and suggestions are made concerning system architecture needed for best results. Finally, avenues for extension of the theory to more general systems are indicated.
Issue Date:1988
Type:Text
Description:126 p.
Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1988.
URI:http://hdl.handle.net/2142/69597
Other Identifier(s):(UMI)AAI8823208
Date Available in IDEALS:2014-12-15
Date Deposited:1988


This item appears in the following Collection(s)

Item Statistics