Files in this item



application/pdf8409749.pdf (7MB)Restricted to U of Illinois
(no description provided)PDF


Title:Content Duplication Among Documents on Specific Subjects
Author(s):Blue, Richard Irving
Department / Program:Library Science
Discipline:Library Science
Degree Granting Institution:University of Illinois at Urbana-Champaign
Subject(s):Information Science
Abstract:An attempt was made to see how duplication of textual content among documents on the same specific medical subjects varied with time. Text analysis was performed on sets of documents that had been located with MEDLINE searches. Duplication of sentence segments between two documents was assumed to indicate duplication of content. More duplication of sentence segments was found for current documents than for older ones.
Use of sentence segment duplication as a measure of document overlap was also compared to use of interdocument duplication of index terms (a method described earlier by Cleverdon and Kidd) as a measure of document overlap. A greater range of variability was found in the text overlap measure, which may indicate that textual content duplication is the more discerning of the two measures.
Although the results were inconclusive in many ways, the study at least tested an alternative methodology for measuring content overlap and prepared the way for larger scale studies.
Issue Date:1984
Description:247 p.
Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1984.
Other Identifier(s):(UMI)AAI8409749
Date Available in IDEALS:2014-12-15
Date Deposited:1984

This item appears in the following Collection(s)

Item Statistics