Files in this item

FilesDescriptionFormat

application/pdf

application/pdfCHAN-THESIS-2017.pdf (833kB)
(no description provided)PDF

Description

Title:Probabilistic interpretation of path-based relevance in heterogeneous information networks
Author(s):Chan, Po-Wei
Advisor(s):Han, Jiawei
Department / Program:Computer Science
Discipline:Computer Science
Degree Granting Institution:University of Illinois at Urbana-Champaign
Degree:M.S.
Genre:Thesis
Subject(s):Relevance measure
Heterogeneous information network
Generative model
Abstract:As a powerful representation paradigm for networked and multi-typed data, the heterogeneous information network (HIN) is ubiquitous. Meanwhile, defining proper relevance measures has always been a fundamental problem and of great pragmatic importance for network mining tasks. Inspired by the probabilistic interpretation of existing path-based relevance measures, we propose to study HIN relevance from a probabilistic perspective. We also identify, from real-world data, and propose to model cross-meta-path synergy, which is a characteristic important for defining path-based HIN relevance and has not been modeled by existing methods. A generative model is established to derive a novel path-based relevance measure, which is data-driven and tailored for each HIN. We develop an inference algorithm to find the maximum a posteriori (MAP) estimate of the model parameters, which entails non-trivial tricks. Experiments on two real-world datasets demonstrate the effectiveness of the proposed model and relevance measure.
Issue Date:2017-04-24
Type:Text
URI:http://hdl.handle.net/2142/97416
Rights Information:Copyright 2017 Po-Wei Chan
Date Available in IDEALS:2017-08-10
Date Deposited:2017-05


This item appears in the following Collection(s)

Item Statistics