Home

Sound Texture Modelling with Linear Prediction in both Time and Frequency Domains

Marios Athineos; Daniel P. W. Ellis

Title:
Sound Texture Modelling with Linear Prediction in both Time and Frequency Domains
Author(s):
Athineos, Marios
Ellis, Daniel P. W.
Date:
Type:
Articles
Department:
Electrical Engineering
Permanent URL:
Book/Journal Title:
2003 IEEE International Conference on Acoustics, Speech, and Signal Processing: Proceedings: April 6-10, 2003, Hong Kong Exhibition and Convention Centre, Hong Kong
Publisher:
IEEE
Publisher Location:
Piscataway, N.J.
Abstract:
Sound textures - for instance, a crackling fire, running water, or applause - constitute a large and largely neglected class of audio signals. Whereas tonal sounds have been effectively and flexibly modelled with sinusoids, aperiodic energy is usually modelled as white noise filtered to match the approximate spectrum of the original over 10-30 ms windows, which fails to provide a perceptually satisfying reproduction of many real-world noisy sound textures. We attribute this failure to the loss of short-term temporal structure, and we introduce a second modelling stage in which the time envelope of the residual from conventional linear predictive modelling is itself modelled with linear prediction in the spectral domain. This cascade time- and frequency-domain linear prediction (CTFLP) leads to noise-excited resyntheses that have high perceptual fidelity. We perform a novel quantitative error analysis by measuring the proportional error within time-frequency cells across a range of timescales.
Subject(s):
Electrical engineering
Publisher DOI:
http://dx.doi.org/10.1109/ICASSP.2003.1200054
Item views:
43
Metadata:
View

In Partnership with the Center for Digital Research and Scholarship at Columbia University Libraries/Information Services.