Presentations (Communicative Events)

Modeling Prosody Automatically in Concept-to-Speech Generation

Pan, Shimei

A Concept-to-Speech (CTS) Generator is a system which integrates language generation with speech synthesis and produces speech from semantic representations. This is in contrast to Text-to-Speech (TTS) systems where speech is produced from text. CTS systems have an advantage over TTS because of the availability of semantic and pragmatic information, which are considered crucial for prosody generation, a process which models the variations in pitch, tempo and rhythm. My goal is to build a CTS system which produces more natural and intelligible speech than TTS. The CTS system is being developed as part of MAGIC (Dalal et al. 1996), a multimedia presentation generation system for health-care domain.

Files

More About This Work

Academic Units
Computer Science
Publisher
Proceedings of AAAI'99
Published Here
May 3, 2013