2011 Articles
Evaluating music sequence models through missing data
Building models of the structure in musical signals raises the question of how to evaluate and compare different modeling approaches. One possibility is to use the model to impute deliberately removed patches of missing data, then to compare the model's predictions with the part that was removed. We analyze a corpus of popular music audio represented as beat-synchronous chroma features, and compare imputation based on simple linear prediction to more complex models including nearest neighbor selection and shift-invariant probabilistic latent component analysis. Simple linear models perform best according to Euclidean distance, despite producing stationary results which are not musically meaningful. We therefore investigate alternate evaluation measures and observe that an entropy difference metric correlates better with our expectations for musically consistent reconstructions. Under this measure, the best-performing imputation algorithm reconstructs masked sections by choosing the nearest neighbor to the surrounding observations within the song. This result is consistent with the large amount of repetition found in pop music.
Subjects
Files
- BertMGWE11-musseq.pdf application/pdf 204 KB Download File
Also Published In
- Title
- 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing: Proceedings: May 22-27, 2011 Prague Congress Center, Prague, Czech Republic
- Publisher
- IEEE
- DOI
- https://doi.org/10.1109/ICASSP.2011.5946369
More About This Work
- Academic Units
- Electrical Engineering
- Published Here
- June 25, 2012