Browse by Subject "text mining"
Now showing items 1-11 of 11
(2013-08-22)Sustainability indicators are metrics that are used to assess and track sustainable development, such as the number of people living in poverty or the conservation status of endangered species. Defining sustainability ...
(2010-02-03)Automatically identifying semantic relationships from text plays an important role in knowledge discovery, for example to connect a researcher in one discipline to related research questions in a second discipline in ...
(iSchools, 2013-02)This study presents the idea and design of a method to characterize the scholar h-index by full-text citation analysis. The method combines the citation analysis and text mining to modify the oversimplified process of ...
(2010-05-19)Discovery Driven Analysis (DDA) is a common feature of OLAP technology to analyze structured data. In essence, DDA helps analysts to discover anomalous data by highlighting 'unexpected' values in the OLAP ...
(2012-05-22)With the development of Web 2.0, a huge amount of user generated data in social media sites is attracting the attentions from different research areas. Social media data has heterogenous data types including link, text and ...
Finding the Canary in Text Mining: Analysis of the Use and Users of MONK Text Mining Research Software (2010-11-21)MONK is a text mining research software tool hosted by the University of Illinois Library that enables humanities scholars to mine data from digitized texts in select literary databases and archives. This poster presents ...
(2013-09-15)A list of stop words used by Andrew Goldstone and Ted Underwood to topic model a set of seven scholarly journals in literary studies. Text file with one word per line.
text/plainText file (41kB)
(2012-10-04)Presentation given by Ted Underwood at DHS 2012 on text mining and topic modeling.
application/vnd.openxmlformats-officedocument.presentationml.presentationMicrosoft PowerPoint 2007 (619kB)
(2015-01-21)The “big data” era is characterized by an explosion of information in the form of digital data collections, ranging from scientific knowledge, to social media, news, and everyone’s daily life. Valuable knowledge about ...
(2015-04-15)Non-native speakers of English far outnumber native speakers; English is the main language of books, newspapers, airports, air-traffic control, international business, academic conferences, science, technology, diplomacy, ...
(iSchools, 2014-03-01)This poster proposes the use of Named Entity Recognition as a heuristic tool for improving manual document classification. This technique was developed as part of a project studying collaborative work via the acknowledgment ...