Academic Commons

Chapters (Layout Features)

An Introduction to Signal Processing for Speech

Ellis, Daniel P. W.

The formal tools of signal processing emerged in the mid twentieth century when electronics gave us the ability to manipulate signals — time-varying measurements — to extract or rearrange various aspects of interest to us, i.e., the information in the signal. The core of traditional signal processing is a way of looking at the signals in terms of sinusoidal components of differing frequencies (the Fourier domain), and a set of techniques for modifying signals that are most naturally described in that domain, i.e., filtering. Although originally developed using analog electronics, since the 1970s signal processing has more and more been implemented on computers in the digital domain, leading to some modifications to the theory without changing its essential character. This chapter aims to give a transparent and intuitive introduction to the basic ideas of the Fourier domain and filtering, and connects them to some of the common representations used in speech science, including the spectrogram and cepstral coefficients. We assume the absolute minimum of prior technical background, which will naturally be below the level of many readers; however, there may be some value in taking such a ground-up approach even for those for whom much of the material is review.

Files

Also Published In

Title
The Handbook of Phonetic Sciences
Publisher
Wiley-Blackwell
DOI
https://doi.org/10.1002/9781444317251.ch20

More About This Work

Academic Units
Electrical Engineering
Published Here
February 15, 2012
Academic Commons provides global access to research and scholarship produced at Columbia University, Barnard College, Teachers College, Union Theological Seminary and Jewish Theological Seminary. Academic Commons is managed by the Columbia University Libraries.