2011 Articles
Classifying soundtracks with audio texture features
Sound textures may be defined as sounds whose character depends on statistical properties as much as the specific details of each individually-perceived event. Recent work has devised a set of statistics that, when synthetically imposed, allow listeners to identify a wide range of environmental sound textures. In this work, we investigate using these statistics for automatic classification of a set of environmental sound classes defined over a set of web videos depicting "multimedia events". We show that the texture statistics perform as well as our best conventional statistics (based on MFCC covariance). We further examine the relative contributions of the different statistics, showing the importance of modulation spectra and cross-band envelope correlations.
Subjects
Files
- EllisZM11-texture.pdf application/pdf 438 KB Download File
Also Published In
- Title
- 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing: Proceedings: May 22-27, 2011 Prague Congress Center, Prague, Czech Republic
- Publisher
- IEEE
- DOI
- https://doi.org/10.1109/ICASSP.2011.5947699
More About This Work
- Academic Units
- Electrical Engineering
- Published Here
- June 25, 2012