Harry Bratt

June 1, 2003

Iterative Statistical Language Model Generation for Use with an Agent-Oriented Natural Language Interface

We describe a method for developing a statistical language model (SLM) with high keyword spotting accuracy for a natural language interface (NLI). The NLI is based on the Adaptive Agent Oriented Software Architecture (AAOSA).

August 1, 2000

The SRI EduSpeak(TM) System: Recognition and Pronunciation Scoring for Language Learning

The EduSpeak(TM) system is a software development toolkit that enables developers of interactive language education software to use state-of-the-art speech recognition and pronunciation scoring technology.

May 1, 2000

The SRI March 2000 Hub-5 Conversational Speech Transcription System

We describe SRI’s large vocabulary conversational speech recognition system as used in the March 2000 NIST Hub-5E evaluation.

August 1, 1998

Collection and Detailed Transcription of a Speech Database for Development of Language Learning Technologies

We describe the methodologies for collecting and annotating a Latin-American Spanish speech database. We use the annotated database to investigate rater reliability, the effect of each phone on overall perceived nonnativeness, and the frequency of specific pronunciation errors.

September 1, 1997

HMM State Clustering Across Allophone Class Boundaries

We present a novel approach to hidden Markov model (HMM) state clustering based on the use of broad phone classes and an allophone class entropy measure. Our algorithm allows clustering across allophone class boundaries by defining broad phone groups within which two states from different allophone classes can be clustered together.