Academic Commons

Articles

Using Mutual Information to design class-specific phone recognizers

Scanlon, Patricia; Ellis, Daniel P. W.; Reilly, Richard

Information concerning the identity of subword units such as phones cannot easily be pinpointed because it is broadly distributed in time and frequency. Continuing earlier work, we use Mutual Information as measure of the usefulness of individual time-frequency cells for various speech classification tasks, using the hand-annotations of the TIMIT database as our ground truth. Since different broad phonetic classes such as vowels and stops have such different temporal characteristics, we examine mutual information separately for each class, revealing structure that was not uncovered in earlier work; further structure is revealed by aligning the time-frequency displays of each phone at the center of their hand-marked segments, rather than averaging across all possible alignments within each segment. Based on these results, we evaluate a range of vowel classifiers over the TIMIT test set and show that selecting input features according to the mutual information criteria can provides a significant increase in classification accuracy.

Files

Also Published In

Title
8th European Conference on Speech Communication and Technology: Eurospeech '03: An Interspeech Event: September 1 - 4, 2003, Geneva, Switzerland

More About This Work

Academic Units
Electrical Engineering
Publisher
ISCA
Published Here
June 29, 2012
Academic Commons provides global access to research and scholarship produced at Columbia University, Barnard College, Teachers College, Union Theological Seminary and Jewish Theological Seminary. Academic Commons is managed by the Columbia University Libraries.