A framework for similarity-based clustering of biomedical citation graphs
Mohasel Arjomandi, Hossein
Loading…
Permalink
https://hdl.handle.net/2142/127150
Description
Title
A framework for similarity-based clustering of biomedical citation graphs
Author(s)
Mohasel Arjomandi, Hossein
Issue Date
2024-09-13
Director of Research (if dissertation) or Advisor (if thesis)
Chacko, George
Department of Study
Siebel School Comp & Data Sci
Discipline
Computer Science
Degree Granting Institution
University of Illinois at Urbana-Champaign
Degree Name
M.S.
Degree Level
Thesis
Keyword(s)
Clustering
Scientometrics
Community Detection
Biomedical Documents
Language
eng
Abstract
In many scientific fields, identifying groups of similar items within complex systems is crucial, often referred to as community detection in networks. This research focuses on community detection in citation graphs of PubMed documents. While existing methods have explored clustering citation graphs, they often neglect article metadata or rely on non-scalable solutions. In this thesis, I propose a framework for community detection in any citation graph within the PubMed database, utilizing both article metadata and citation graph structure. The core of this framework is a similarity metric that measures document relevance by incorporating metadata and citation importance weights. Scalability is achieved through a parallel framework for data gathering and feature construction, demonstrated by clustering a 7.5 million node graph. The approach is flexible, allowing users to adjust feature importance and content. The effectiveness of this method was evaluated using the Leiden CPM clustering algorithm across various feature combinations to assess their impact on cluster quality. Additionally, the approach’s potential for application beyond citation graphs, such as in social networks, is discussed. In summary, the developed similarity metric enhances community detection by integrating both topology and metadata, providing a foundation for future research in this area.
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.