Academic Commons

Chapters (Layout Features)

Modeling the auditory organization of speech: a summary and some comments

Ellis, Daniel P. W.

The preceding three chapters have been concerned with the issues arising as a result of the inconvenient fact that our ears are rarely presented with the sound of a single speaker in isolation, but more often with a combination of several speech and nonspeech sounds which may also have been further altered by the acoustic environment. Faced with such a mixture, the listener evidently needs to consider each source separately, and this process of information segregation is known as auditory organization or auditory scene analysis (Bregman, 1990). Pure curiosity as well as the possibility of applications in automatic signal interpretation drive us to investigate auditory scene analysis through psychological experiments and computational modeling. Having sketched this framework and the current limits to our understanding of the process of auditory organization, we can now examine the material of each of the three chapters in more detail, seeing how it fits into this framework and also where the framework may be inadequate. Following these discussions, we will conclude with some remarks suggested by the particular combination of results in this section.

Subjects

Files

  • thumnail for Ellis98-modelspeech.pdf Ellis98-modelspeech.pdf application/x-pdf 64.9 KB Download File

Also Published In

Title
Listening to speech: an auditory perspective
Publisher
Lawrence Erlbaum

More About This Work

Academic Units
Electrical Engineering
Published Here
February 15, 2012
Academic Commons provides global access to research and scholarship produced at Columbia University, Barnard College, Teachers College, Union Theological Seminary and Jewish Theological Seminary. Academic Commons is managed by the Columbia University Libraries.