• Skip to primary navigation
  • Skip to main content
SRI logo
  • About
    • Press room
  • Expertise
    • Advanced imaging systems
    • Artificial intelligence
    • Biomedical R&D services
    • Biomedical sciences
    • Computer vision
    • Cyber & formal methods
    • Education and learning
    • Innovation strategy and policy
    • National security
    • Ocean & space
    • Quantum
    • QED-C
    • Robotics, sensors & devices
    • Speech & natural language
    • Video test & measurement
  • Ventures
  • NSIC
  • Careers
  • Contact
  • 日本支社
Search
Close
Home » Archives for Kristin Precoda
Kristin Precoda

Kristin Precoda

Go to bio page

Publications

Biomedical sciences publications May 1, 2015 Article

Classification of lexical stress using spectral and prosodic features for computer-assisted language learning systems

Harry Bratt, Colleen Richey, Horacio Franco, Victor Abrash, Kristin Precoda

We present a system for detection of lexical stress in English words spoken by English learners. This system was designed to be part of the EduSpeak® computer-assisted language learning (CALL) software. The system uses both prosodic and spectral features to detect the level of stress (unstressed, primary or secondary) for each syllable in a word. Features are computed on the vowels and include normalized energy, pitch, spectral tilt, and duration measurements, as well as log-posterior probabilities obtained from the frame-level mel-frequency cepstral coefficients (MFCCs). Gaussian mixture models (GMMs) are used to represent the distribution of these features for each stress class. The system is trained on utterances by L1-English children and tested on English speech from L1-English children and L1-Japanese children with variable levels of English proficiency. Since it is trained on data from L1-English speakers, the system can be used on English utterances spoken by speakers of any L1 without retraining. Furthermore, automatically determined stress patterns are used as the intended target; therefore, hand-labeling of training data is not required. This allows us to use a large amount of data for training the system. Our algorithm results in an error rate of approximately 11% on English utterances from L1-English speakers and 20% on English utterances from L1-Japanese speakers. We show that all features, both spectral and prosodic, are necessary for achievement of optimal performance on the data from L1-English speakers; MFCC log-posterior probability features are the single best set of features, followed by duration, energy, pitch and finally, spectral tilt features. For English utterances from L1-Japanese speakers, energy, MFCC log-posterior probabilities and duration are the most important features.

Speech & natural language publications May 1, 2014 Conference Paper

Lexical Stress Classification for Language Learning Using Spectral and Segmental Features

Victor Abrash, Kristin Precoda, Horacio Franco, Harry Bratt, Colleen Richey

We present a system for detecting lexical stress in English words spoken by English learners.  The system uses both spectral and segmental features to detect three levels of stress for each syllable in a word. 

Speech & natural language publications March 1, 2012

Detecting Leadership and Cohesion in Spoken Interactions

Kristin Precoda, Colleen Richey

We present a system for detecting leadership and group cohesion in multiparty dialogs and broadcast conversations in English and Mandarin.

Speech & natural language publications March 1, 2012 Conference Paper

Unsupervised topic modeling for leader detection in spoken discourse

Kristin Precoda

In this paper, we describe a method for leader detection in multiparty spoken discourse that relies on unsupervised topic modeling to segment the discourse automatically.

Speech & natural language publications June 1, 2011 Conference Paper

Detection of agreement and disagreement in broadcast conversations

Kristin Precoda, Colleen Richey

We present Conditional Random Fields based approaches for detecting agreement/disagreement between speakers in English broadcast conversation shows.

Information & computer science publications May 1, 2011 Conference Paper

Automatic identification of speaker role and agreement/disagreement in broadcast conversation

Kristin Precoda, Colleen Richey

We present supervised approaches for detecting speaker roles and agreement/disagreement between speakers in broadcast conversation shows in three languages: English, Arabic, and Mandarin.

Speech & natural language publications December 1, 2010 Conference Paper

Implementing SRI’s Pashto speech-to-speech translation system on a smartphone

Dimitra Vergyri, Kristin Precoda

We describe our recent effort implementing SRI’s UMPC-based Pashto speech-to-speech (S2S) translation system on a smart phone running the Android operating system.

Information & computer science publications July 1, 2010 Article

EduSpeak®: A Speech Recognition and Pronunciation Scoring Toolkit for Computer-Aided Language Learning Applications

Horacio Franco, Harry Bratt, Victor Abrash, Kristin Precoda

SRI International’s EduSpeak® system is a SDK that enables developers of interactive language education software to use state-of-the-art speech recognition and pronunciation scoring technology.

Speech & natural language publications April 1, 2009 Conference Paper

Recent advances in SRI’s IraqComm Iraqi Arabic-English speech-to-speech translation system

Horacio Franco, Andreas Kathol, Kristin Precoda, Colleen Richey, Dimitra Vergyri

We summarize recent progress on SRI’s IraqComm™ IraqiArabic-English two-way speech-to-speech translation system.

  • Go to page 1
  • Go to page 2
  • Go to Next Page »

How can we help?

Once you hit send…

We’ll match your inquiry to the person who can best help you.

Expect a response within 48 hours.

Career call to action image

Make your own mark.

Search jobs

Our work

Case studies

Publications

Timeline of innovation

Areas of expertise

Institute

Leadership

Press room

Media inquiries

Compliance

Careers

Job listings

Contact

SRI Ventures

Our locations

Headquarters

333 Ravenswood Ave
Menlo Park, CA 94025 USA

+1 (650) 859-2000

Subscribe to our newsletter


日本支社
SRI International
  • Contact us
  • Privacy Policy
  • Cookies
  • DMCA
  • Copyright © 2022 SRI International