Files in this item
|(no description provided)||PNG image|
|Title:||UIUC University Library Dewey print collection: Analysis by subject headings|
Dewey Decimal Classification
As of January 2018, the UIUC University Library print collection catalog contained over 10.7 million records. In order to discover the dominating subjects in the collection, the subject headings were analyzed. At the UIUC University Library, one of the classifications that is used to arrange print titles on the shelves by topic is the Dewey Decimal Classification. For the current analysis of the Library print collection, the items with Dewey call numbers were selected and subject headings were analyzed going down to subclasses of ten Dewey main subject classes. Constraints: Items whose call numbers are assigned according to the Dewey DC, but start with prefixes, were not included in the analysis. Prefixes denote subcategories of materials such as oversized.
The dataset consists of data pulled from the Voyager as of January 2018. Originally, data did not have titles’ subject heading information, therefore, subject heading (class/subclass) was assigned to a title according to its call number. For preprocessing and data cleaning, MS SQL database, MS SQL Server Management Studio, SQL, and regular expressions were utilized. For data visualization and analysis, Tableau was used. The total number of items in the Dewey Print Collection in the analysis is 6,253,928.
|Rights Information:||Copyright 2018 Vera Vasileva|
|Date Available in IDEALS:||2019-01-30|