Academic Commons

Articles

Detecting Music in Ambient Audio by Long-Window Autocorrelation

Lee, Keansub; Ellis, Daniel P. W.

We address the problem of detecting music in the background of ambient real-world audio recordings such as the sound track of consumer-shot video. Such material may contain high levels of noises, and we seek to devise features that will reveal music content in such circumstances. Sustained, steady musical pitches show significant, structured autocorrelation at when calculated over windows of hundreds of milliseconds, where autocorrelation of aperiodic noise has become negligible at higher-lag points if a signal is whitened by LPC. Using such features, further compensated by their long-term average to remove the effect of stationary periodic noise, we produce GMM and SVM based classifiers with high performance compared with previous approaches, as verified on a corpus of real consumer video.

Files

Also Published In

Title
2008 IEEE International Conference on Acoustics, Speech, and Signal Processing: ICASSP '08: Proceedings: March 30-April 4, 2008 Caesars Palace Las Vegas, Nevada, U.S.A.
DOI
https://doi.org/10.1109/ICASSP.2008.4517533

More About This Work

Academic Units
Electrical Engineering
Publisher
IEEE
Published Here
June 27, 2012
Academic Commons provides global access to research and scholarship produced at Columbia University, Barnard College, Teachers College, Union Theological Seminary and Jewish Theological Seminary. Academic Commons is managed by the Columbia University Libraries.