Home

Using Prosodic Features of Speech and Audio Localization in Graphical User Interfaces

Alex Olwal; Steven K. Feiner

Title:
Using Prosodic Features of Speech and Audio Localization in Graphical User Interfaces
Author(s):
Olwal, Alex
Feiner, Steven K.
Date:
Type:
Technical reports
Department:
Computer Science
Permanent URL:
Series:
Columbia University Computer Science Technical Reports
Part Number:
CUCS-016-03
Publisher:
Department of Computer Science, Columbia University
Publisher Location:
New York
Abstract:
We describe several approaches for using prosodic features of speech and audio localization to control inter-active applications. This information can be used for parameter control, as well as for disambiguating speech recognition. We discuss how characteristics of the spoken sentences can be exploited in the user interface; for example, by considering the speed with which the sentence was spoken and the presence of extraneous utterances. We also show how coarse audio localization can be used for low-fidelity gesture tracking, by inferring the speaker's head position.
Subject(s):
Computer science
Item views:
337
Metadata:
text | xml

In Partnership with the Center for Digital Research and Scholarship at Columbia University Libraries/Information Services | Terms of Use