Files in this item



application/pdfMert_Bay.pdf (1MB)
(no description provided)PDF


Title:Methods for multiple pitch tracking and instrument separation from monaural polyphonic recordings
Author(s):Bay, Mert
Director of Research:Beauchamp, James W.
Doctoral Committee Chair(s):Beauchamp, James W.
Doctoral Committee Member(s):Downie, J. Stephen; Hasegawa-Johnson, Mark A.; Loui, Michael C.; Smaragdis, Paris
Department / Program:Electrical & Computer Eng
Discipline:Electrical & Computer Engr
Degree Granting Institution:University of Illinois at Urbana-Champaign
Subject(s):Signal processing
Machine learning
Music information retrieval
Music signal processing
Abstract:Recently there has been a greater need to analyze, summarize, and categorize the increasing amount of audio content in the world. Most of this content comes from polyphonic music as mixtures of audio sources. Recently there has been much interest in the analysis of polyphonic music. Analysis results can be in the form of source tracking, where instrument pitch tracks and their weights are estimated from a sound mixture throughout time, or they would be in the form of source separation where individual sources are extracted from the mixture. Both problems are addressed in this dissertation. The main problem in the analysis of audio mixtures results from multiple source harmonic frequencies frequently overlapping with each other. Although audio sources are non-stationary, their spectra have a considerable amount of structure that can differentiate them from other sources. Recently non-negative matrix factorization (NMF) and probabilistic latent component analysis (PLCA) have been used by many researchers for the analysis of polyphonic audio. They provide good representations of audio mixtures as sums of individual sources. To solve the multiple instrument tracking problem, a hierarchical probabilistic model is proposed as an extension of probabilistic latent component analysis to include parameter estimation of basis spectra and their relative weights for each instrument and their pitches. A pitch-informed NMF based method is proposed to resolve overlapping harmonics in source separation problems. Both methods were trained in advance on example spectra from similar instruments. Both methods were tested on standard datasets, and they were found to outperform several prior unsupervised state-of-the-art methods addressing similar problems.
Issue Date:2013-02-03
Rights Information:Copyright 2012 Mert Bay
Date Available in IDEALS:2013-02-03
Date Deposited:2012-12

This item appears in the following Collection(s)

Item Statistics