Files in this item



application/pdf3425399.pdf (5MB)Restricted to U of Illinois
(no description provided)PDF


Title:Generative Models for Retrieval of Video, Audio and Text Data
Author(s):Velivelli, Atulya
Doctoral Committee Chair(s):Huang, Thomas S.
Department / Program:Electrical and Computer Engineering
Discipline:Electrical and Computer Engineering
Degree Granting Institution:University of Illinois at Urbana-Champaign
Subject(s):Engineering, Electronics and Electrical
Abstract:We propose a general approach to audio segment retrieval, which allows a user to query audio data by an example audio segment of a short duration and to find similar segments. The basic idea of our approach is to first train a hidden Markov model (HMM) using the given example, it is called the theme HMM. The total audio data available is used to train a background HMM. We combine these individual HMMs to form a synthesized "background-theme-background" HMM. This synthesized HMM can then be applied to any audio stream as a parser to detect the most likely theme segment. A major advantage of this approach is that it does not assume any predefined segment boundaries as in previous work and thus can be expected to retrieve theme segments with more accurate boundaries. We overcome the problem of only being able to use a short duration query to train a theme HMM by using the maximum a posteriori estimator with the background model as a prior model. Evaluation of the proposed retrieval scheme using short duration example audio clips of narration as queries gives quite promising results.
Issue Date:2010
Description:120 p.
Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2010.
Other Identifier(s):(MiAaPQ)AAI3425399
Date Available in IDEALS:2015-09-25
Date Deposited:2010

This item appears in the following Collection(s)

Item Statistics