Files in this item

FilesDescriptionFormat

application/pdf

application/pdf3301186.pdf (3MB)Restricted to U of Illinois
(no description provided)PDF

Description

Title:Probabilistic Correspondence Mapping for Audiovisual Speaker Modeling
Author(s):Liu, Ming
Doctoral Committee Chair(s):Thomas Huang
Department / Program:Electrical and Computer Engineering
Discipline:Electrical and Computer Engineering
Degree Granting Institution:University of Illinois at Urbana-Champaign
Degree:Ph.D.
Genre:Dissertation
Subject(s):Engineering, Electronics and Electrical
Abstract:In addition to the framework of probabilistic correspondence mapping on audiovisual speaker modeling, we also explore the correspondence problems with different constraints. Frequency domain correspondence between speakers is established via dynamic programming for speaker normalization in speech recognition tasks. The adjacent constraints in frequency domain actually help to stabilize the algorithm, similar to the dynamic time warping techniques. We also explore the correspondence problem given the manifold structure of different pose face images. It turns out that the manifold structure is very useful to build a good correspondence across different subjects. For audiovisual fusion, a new fusion scheme factorizes audio and visual features into correlated and uncorrelated ones. The correlated features are considered to be the correspondence between two modalities.
Issue Date:2007
Type:Text
Language:English
Description:91 p.
Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2007.
URI:http://hdl.handle.net/2142/81062
Other Identifier(s):(MiAaPQ)AAI3301186
Date Available in IDEALS:2015-09-25
Date Deposited:2007


This item appears in the following Collection(s)

Item Statistics