Files in this item



application/pdfZhang, Hongshuo-Thesis.pdf (540kB)Restricted to U of Illinois
(no description provided)PDF


Title:Wikipedia entity retrieval with word and entity embedding space alignment
Author(s):Zhang, Hongshuo
Contributor(s):Chang, Kevin
Degree:B.S. (bachelor's)
Subject(s):Information Retrieval
Search Engine
Abstract:User queries used to search on Wikipedia might not always correspond to a Wikipedia article. Searching for such queries on the built-in search engine for Wikipedia does not return as many relevant Wikipedia articles as expected, since it is only text-based, and it overlooks the relationship between different Wikipedia articles. To improve the results of the Wikipedia search engine, with the help of the Wikipedia Link Graph, we propose a simple word – entity embedding space alignment model for searching relevant Wikipedia articles using fringe keywords from the Computer Science domain. By using this model, our problem can be reduced to finding the closest neighbors to a user query translated from the word space to the entity space. Due to a lack of dataset available for this purpose, a custom dataset constructed from Wikipedia2Vec is used to evaluate the model. Empirically, our word – entity alignment model does not always fair better than the text-based models.
Issue Date:2021-05
Genre:Dissertation / Thesis
Date Available in IDEALS:2021-08-11

This item appears in the following Collection(s)

Item Statistics