Files in this item

FilesDescriptionFormat

application/pdf

application/pdfZHU-THESIS-2019.pdf (386kB)Restricted to U of Illinois
(no description provided)PDF

Description

Title:FUSE: Multi-faceted set expansion by coherent clustering of skip-grams
Author(s):Zhu, Wanzheng
Advisor(s):Han, Jiawei
Department / Program:Electrical & Computer Eng
Discipline:Electrical & Computer Engr
Degree Granting Institution:University of Illinois at Urbana-Champaign
Degree:M.S.
Genre:Thesis
Subject(s):Set Expansion, Word Sense Disambiguation
Abstract:Set expansion aims to expand a small set of seed entities into a complete set of relevant entities. Most existing approaches assume the input seed set is unambiguous and completely ignore the multi-faceted semantics of seed entities. As a result, given the seed set {"Canon", "Sony", "Nikon"}, previous methods return one mixed set of entities that are either camera brands or Japanese companies. In this thesis, we study the task of multi-faceted set expansion, which aims to capture all semantic facets in the seed set and returns multiple sets of entities, one for each semantic facet. We propose an unsupervised framework, FUSE, which consists of three major components: (1) facet discovery module: identifies all semantic facets of each seed entity by extracting and clustering its skip-grams, (2) facet fusion module: discovers shared semantic facets of the entire seed set by an optimization formulation, and (3) entity expansion module: expands each semantic facet by utilizing an iterative algorithm robust to skip-gram noise. Extensive experiments demonstrate that our algorithm, FUSE, can accurately identify multiple semantic facets of the seed set and generate quality entities for each facet.
Issue Date:2019-04-12
Type:Text
URI:http://hdl.handle.net/2142/105012
Rights Information:Copyright 2019 Wanzheng Zhu
Date Available in IDEALS:2019-08-23
Date Deposited:2019-05


This item appears in the following Collection(s)

Item Statistics