January 1, 2002

Prosody Modeling for Automatic Speech Recognition and Understanding

Citation

Shriberg, E., & Stolcke, A. (2004). Prosody modeling for automatic speech recognition and understanding. In Mathematical Foundations of Speech and Language Processing (pp. 105-114). Springer, New York, NY

Abstract

This paper summarizes statistical modeling approaches for the use of prosody (the rhythm and melody of speech) in automatic recognition and understanding of speech. We outline effective prosodic feature extraction, model architectures, and techniques to combine prosodic with lexical (word-based) information. We then survey a number of applications of the framework, and give results for automatic sentence segmentation and disfluency detection, topic segmentation, dialog act labeling, and word recognition.

Keywords: Prosody, speech recognition and understanding, hidden Markov models.

↓ Download

Prosody Modeling for Automatic Speech Recognition and Understanding

Abstract

Read more from SRI

SRI appoints Peter Marcotullio as Senior Vice President of Commercialization

PARC Forum: How innovation can save the living ocean

Researchers assess EV industry workforce potential in Northeast Ohio