Academic Commons

Presentations (Communicative Events)

Discriminative Phonotactics for Dialect Recognition Using Context-Dependent Phone Classifiers

Biadsy, Fadi; Soltau, Hagen; Mangu, Lidia; Navratil, Jiri; Hirschberg, Julia Bell

In this paper, we introduce a new approach to dialect recognition that relies on context-dependent (CD) phonetic differences between dialects as well as phonotactics. Given a speech utterance, we obtain the phone sequence using a CD-phone recognizer. We then identify the most likely dialect of these CD-phones using SVM classifiers. Augmenting these phones with the output of these classifiers, we extract augmented phonotactic features which are subsequently given to a logistic regression classifier to obtain a dialect detection score. We test our approach on the task of detecting four Arabic dialects from 30s utterances. We compare our performance to two baselines, PRLM and GMM-UBM, as well as to our own improved version of GMM-UBM which employs fMLLR adaptation. Our approach performs significantly better than all three baselines at 5% absolute Equal Error Rate (EER). The overall EER of our system is 6%.

Files

  • thumnail for biadsy_et_al_odyssey10.pdf biadsy_et_al_odyssey10.pdf application/pdf 797 KB Download File

More About This Work

Academic Units
Computer Science
Publisher
Odyssey 2010, The Speaker and Language Recognition Workshop
Published Here
August 5, 2013

Notes

Presentation slides are available at http://hdl.handle.net/10022/AC:P:21232

Academic Commons provides global access to research and scholarship produced at Columbia University, Barnard College, Teachers College, Union Theological Seminary and Jewish Theological Seminary. Academic Commons is managed by the Columbia University Libraries.