Speech & natural language publications

August 1, 2007

Detecting Deception Using Critical Segments

We present an investigation of segments that map to GLOBAL LIES, that is, the intent to deceive with respect to salient topics of the discourse. We propose that identifying the…

Publications, Speech & natural language publications
August 1, 2007

Integrating MAP, Marginals, and Unsupervised Language Model Adaptation

We investigate the integration of various language model adaptation approaches for a cross-genre adaptation task to improve Mandarin ASR system performance on a recently introduced new genre, broadcast conversation (BC).

Publications, Speech & natural language publications
April 1, 2007

Parameterization of Prosodic Feature Distributions for SVM Modeling in Speaker Recognition

This paper explores the important question of finding a good kernel for a system that models syllable-based prosodic features using support vector machines (SVMs). We introduce two new methods for…

Publications, Speech & natural language publications
April 1, 2007

Statistical Sentence Extraction for Information Distillation

In this paper, we present a statistical sentence extraction approach for distillation. Basically, we frame this task as a classification problem, where each candidate sentence in documents is classified as…

Publications, Speech & natural language publications
April 1, 2007

NAP and WCCN: Comparison of Approaches Using MLLR-SVM Speaker Verification System

We compare two recently proposed techniques, within class covariance normalization (WCCN) [1] and nuisance attribute projection (NAP) [2], for intersession variability compensation in speaker verification.

Publications, Speech & natural language publications
April 1, 2007

Analysis of Morph-Based Speech Recognition and the Modeling of Out-of-Vocabulary Words Across Languages

We analyze subword-based language models (LMs) in large-vocabulary continuous speech recognition across four “morphologically rich” languages: Finnish, Estonian, Turkish, and Egyptian Colloquial Arabic.

Publications, Speech & natural language publications
April 1, 2007

Unsupervised Language Model Adaptation for Meeting Recognition

We present an application of unsupervised language model (LM) adaptation to meeting recognition, in a scenario where sequences of multiparty meetings on related topics are to be recognized, but no…

Publications, Speech & natural language publications
April 1, 2007

Combining Discriminative Feature, Transform, and Model Training for Large Vocabulary Speech Recognition

This paper uses a state-of-the-art Mandarin recognition system as a platform to study the interaction of three techniques. Experiments in the broadcast news and broadcast conversation domains show that the…

Publications, Speech & natural language publications
April 1, 2007

Noise Robust Speaker Identification for Spontaneous Arabic Speech

We present an approach that integrates multiple components and models for improved speaker identification in spontaneous Arabic speech in adverse acoustic conditions.

Publications, Speech & natural language publications
April 1, 2007

Speech Recognition as Feature Extraction for Speaker Recognition

We present specific techniques and results from SRI’s NIST speaker recognition evaluation system.

Publications, Speech & natural language publications
January 1, 2007

Significance of Joint Features Derived from the Modified Group Delay Function in Speech Processing

This paper investigates the significance of combining cepstral features derived from the modified group delay function and from the short-time spectral magnitude like the MFCC.

Publications, Speech & natural language publications
January 1, 2007

The ICSI-SRI Spring 2006 Meeting Recognition System

We describe the development of the ICSI-SRI speech recognition system for the NIST Spring 2006 Meeting Rich Transcription (RT-06S) evaluation, highlighting improvements, including the delay-and-sum algorithm, the nearfield segmenter, language…

Publications, Speech & natural language publications