Speech & natural language publications

September 1, 1997

Mixture Input Transformations for Adaptation of Hybrid Connectionist Speech Recognizers

ByVictor Abrash

In this paper, we propose a new algorithm to train mixtures of transformation networks (MTNs) in the hybrid connectionist recognition framework. We apply the new algorithm to nonnative speaker adaptation,…

Publications, Speech & natural language publications
September 1, 1997

Automatic Pronunciation Scoring of Specific Phone Segments for Language Instruction

ByHoracio Franco

The aim of the work described in this paper is to develop methods for automatically assessing the pronunciation quality of specific phone segments uttered by students learning a foreign language.

Education & learning publications, Publications, Speech & natural language publications
September 1, 1997

Explicit Word Error Minimization in N-best List Rescoring

We show that the standard hypothesis scoring paradigm used in maximum-likelihood-based speech recognition systems is not optimal with regard to minimizing the word error rate, the commonly used performance metric…

Publications, Speech & natural language publications
April 1, 1997

Model Transformation for Robust Speaker Recognition from Telephone Data

In the context of automatic speaker recognition, we propose a model transformation technique that renders speaker models more robust to acoustic mismatches and to data scarcity by appropriately increasing their…

Publications, Speech & natural language publications
April 1, 1997

Neural-Network Based Measures of Confidence for Word Recognition

This paper proposes a probabilstic framework to define and evaluate confidence measures for word recognition. We describe a novel method to combine different knowledge sources and estimate the confidence in…

Publications, Speech & natural language publications
April 1, 1997

Handset-Dependent Background Models for Robust Text-Independent Speaker Recognition

This paper studies the effects of handset distortion on telephone-based speaker recognition performance. Results on the 1996 NIST Speaker Recognition Evaluation corpus show that using handset-matched background models reduces false…

Publications, Speech & natural language publications
April 1, 1997

Automatic Pronunciation Scoring for Language Instruction

ByHoracio Franco

In this paper we show that we can significantly improve HMM- based scores by using average phone segment posterior probabilities. Correlation between machine and human scores went up from r=0.50…

Publications, Speech & natural language publications
March 1, 1997

HTTP://WWW.SPEECH.SRI.COM/DEMOS/ATIS.HTML

This paper presents a speech-enabled WWW demonstration based on the Air Travel Information System (ATIS) domain. SRI’s speech recognition technology and natural language understanding are fully integrated in a Java…

Publications, Speech & natural language publications
February 1, 1997

Acoustic Modeling for the SRI Hub4 Partitioned Evaluation Continuous Speech Recognition System

We describe the development of the SRI system evaluated in the 1996 DARPA continuous speech recognition (CSR) Hub4 partitioned evaluation (PE). The task for the Hub4 evaluation was to recognition…

Publications, Speech & natural language publications
February 1, 1997

Hub4 Language Modeling Using Domain Interpolation and Data Clustering

In SRI's language modeling experiments for the Hub4 domain, three basic approaches were pursued: interpolating multiple models estimated from Hub4 and non-Hub4 training data, adapting the language model (LM) to…

Publications, Speech & natural language publications
January 1, 1997

A Speaker Identification Agent

This paper describes a prototype application which combines speaker identification technology and an agent architecture to provide user-definable monitors for incoming voicemail messages. Through a Web-distributable Java user interface, the…

Publications, Speech & natural language publications
October 1, 1996

Word Predictability After Hesitations: A Corpus-based Study

We ask whether lexical hesitations in spontaneous speech tend to precede words that are difficult to predict. We define predictability in terms of both transition probability and entropy, in the…

Publications, Speech & natural language publications