Detecting Music in Ambient Audio by Long-Window Autocorrelation

Lee, Keansub; Ellis, Daniel P. W.

We address the problem of detecting music in the background of ambient real-world audio recordings such as the sound track of consumer-shot video. Such material may contain high levels of noises, and we seek to devise features that will reveal music content in such circumstances. Sustained, steady musical pitches show significant, structured autocorrelation at when calculated over windows of hundreds of milliseconds, where autocorrelation of aperiodic noise has become negligible at higher-lag points if a signal is whitened by LPC. Using such features, further compensated by their long-term average to remove the effect of stationary periodic noise, we produce GMM and SVM based classifiers with high performance compared with previous approaches, as verified on a corpus of real consumer video.


Also Published In

2008 IEEE International Conference on Acoustics, Speech, and Signal Processing: ICASSP '08: Proceedings: March 30-April 4, 2008 Caesars Palace Las Vegas, Nevada, U.S.A.

More About This Work

Academic Units
Electrical Engineering
Published Here
June 27, 2012