Academic Commons

Presentations (Communicative Events)

Machine-Readable Dictionaries in Text-to-Speech Systems

Klavans, Judith L.; Tzoukermann, Evelyne

This paper presents the results of an experiment using machine-readable dictionaries (MRDs) and corpora for building concatenative units for text to speech (TTS) systems. Theoretical questions concerning the nature of phonemic data in dictionaries are raised; phonemic dictionary data is viewed as a representative corpus over which to extract n- gram phonemic frequencies in the language. Dictionary data are compared to corpus data, and phoneme inventories are evaluated for coverage. A methodology is defined to compute phonemic n-grams for incorporation into a TTS system.


  • thumnail for klavans_tzoukermann_94a.pdf klavans_tzoukermann_94a.pdf application/pdf 174 KB Download File

More About This Work

Academic Units
Computer Science
Proceedings of the Fifteenth International Conference on Computational Linguistics (COLING)
Published Here
April 26, 2013
Academic Commons provides global access to research and scholarship produced at Columbia University, Barnard College, Teachers College, Union Theological Seminary and Jewish Theological Seminary. Academic Commons is managed by the Columbia University Libraries.