Academic Commons


An EM Algorithm for Localizing Multiple Sound: Sources in Reverberant Environments

Mandel, Michael I.; Ellis, Daniel P. W.; Jebara, Tony

We present a method for localizing and separating sound sources in stereo recordings that is robust to reverberation and does not make any assumptions about the source statistics. The method consists of a probabilistic model of binaural multisource recordings and an expectation maximization algorithm for finding the maximum likelihood parameters of that model. These parameters include distributions over delays and assignments of time-frequency regions to sources. We evaluate this method against two comparable algorithms on simulations of simultaneous speech from two or three sources. Our method outperforms the others in anechoic conditions and performs as well as the better of the two in the presence of reverberation.


Also Published In

Advances in neural information processing systems 19
proceedings of the 2006 conference

More About This Work

Academic Units
Electrical Engineering
MIT Press
Published Here
June 27, 2012
Academic Commons provides global access to research and scholarship produced at Columbia University, Barnard College, Teachers College, Union Theological Seminary and Jewish Theological Seminary. Academic Commons is managed by the Columbia University Libraries.