Files in this item



application/pdf3301186.pdf (3MB)Restricted to U of Illinois
(no description provided)PDF


Title:Probabilistic Correspondence Mapping for Audiovisual Speaker Modeling
Author(s):Liu, Ming
Doctoral Committee Chair(s):Thomas Huang
Department / Program:Electrical and Computer Engineering
Discipline:Electrical and Computer Engineering
Degree Granting Institution:University of Illinois at Urbana-Champaign
Subject(s):Engineering, Electronics and Electrical
Abstract:In addition to the framework of probabilistic correspondence mapping on audiovisual speaker modeling, we also explore the correspondence problems with different constraints. Frequency domain correspondence between speakers is established via dynamic programming for speaker normalization in speech recognition tasks. The adjacent constraints in frequency domain actually help to stabilize the algorithm, similar to the dynamic time warping techniques. We also explore the correspondence problem given the manifold structure of different pose face images. It turns out that the manifold structure is very useful to build a good correspondence across different subjects. For audiovisual fusion, a new fusion scheme factorizes audio and visual features into correlated and uncorrelated ones. The correlated features are considered to be the correspondence between two modalities.
Issue Date:2007
Description:91 p.
Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2007.
Other Identifier(s):(MiAaPQ)AAI3301186
Date Available in IDEALS:2015-09-25
Date Deposited:2007

This item appears in the following Collection(s)

Item Statistics