Skip to content

Research
Commercialization
About
- People
- News & Stories
- Events
- Our History
- Contact
Innovating In
Careers
日本支社

Research
Commercialization
About
- People
- News & Stories
- Events
- Our History
- Contact
Innovating In
Careers
日本支社

Search sri.com

April 1, 2003

Prosodic Knowledge Sources for Automatic Speech Recognition

Citation

Vergyri, D., Stolcke, A., Gadde, V. R. R., Ferrer, L., & Shriberg, E. (2003, April). Prosodic knowledge sources for automatic speech recognition. In 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings.(ICASSP’03). (Vol. 1, pp. I-I). IEEE.

Abstract

In this work, different prosodic knowledge sources are integrated into a state-of-the-art large vocabulary speech recognition system. Prosody manifests itself on different levels in the speech signal: within the words as a change in phone durations and pitch, inbetween the words as a variation in the pause length, and beyond the words, correlating with higher linguistic structures and nonlexical phenomena. We investigate three models, each exploiting a different level of prosodic information, in rescoring N-best hypotheses according to how well recognized words correspond to prosodic features of the utterance. Experiments on the Switchboard corpus show word accuracy improvements with each prosodic knowledge source. A further improvement is observed with the combination of all models, demonstrating that they each capture somewhat different prosodic characteristics of the speech signal.

↓ Download

Read more from SRI

July 23, 2026

SRI-backed Valence AI raises $5M to integrate emotional intelligence into the trust stack

Emotional inference is becoming a new layer in digital identity and fraud-prevention systems.
July 16, 2026

Tackling quantum scalability with NIST-backed QMEC

SRI-led QMEC will find opportunities and identify gaps within the quantum supply chain.
July 14, 2026

A thousand qubits in bloom, now let’s scale

Standardization is the way to the first quantum computer.

Join Our Team

Build your own legacy

Explore careers

Hire Us

Solutions to your most complex challenges

Send an inquiry

Contact Us

General inquiries

Get the latest news from SRI

Commercialization

Media Inquiries

333 Ravenswood Ave
Menlo Park, CA 94025 USA

+1 (650) 859-2000

© 2026 SRI INTERNATIONAL

Manage Cookie Consent

To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.

Functional Functional Always active

The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.

Preferences Preferences

The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.

Statistics Statistics

The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.

Marketing Marketing

The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.

Manage options
Manage services
Manage {vendor_count} vendors
Read more about these purposes

View preferences

{title}
{title}
{title}