Academic Commons

Articles

Detailed graphical models for source separation and missing data interpolation in audio

Reyes-Gomez, Manuel; Jojic, Nebojsa; Ellis, Daniel P. W.

Methods for blind source separation based only on general properties such as source independence encounter difficulties when the degree of overlap and/or the dimensionality of the observations make the blind inference problem unresolvable. Such situations require additional constraints on the form of the individual sources, motivating the development of models able to capture in detail the consistency and variability of a single source's sound. With single-chain HMMs this requires a very large number of states, making the models marginally practical. Here we present two approaches to factorize the variability in detailed models. First is a coupled subband model, where each source signal is broken into multiple frequency bands, and separate but coupled HMMs are built for each band requiring many fewer states per model. In order to avoid the unnatural state combinations that would arise from independent models for each band, we couple adjacent bands resulting in a grid-like model for the full spectrum. Exact inference of such a model is intractable, but we have derived an efficient approximation based on variational methods.

Files

  • thumnail for snowbird04-transform.pdf snowbird04-transform.pdf application/x-pdf 1.17 MB Download File

Also Published In

Title
The Learning Workshop, Cliff Lodge, Snowbird, Utah, April 3-6, 2012

More About This Work

Academic Units
Electrical Engineering
Publisher
Computational and Biological Learning Society
Published Here
June 29, 2012
Academic Commons provides global access to research and scholarship produced at Columbia University, Barnard College, Teachers College, Union Theological Seminary and Jewish Theological Seminary. Academic Commons is managed by the Columbia University Libraries.