Speech & natural language publications
-
Speaker Recognition with Session Variability Normalization Based on MLLR Adaptation Transforms
We present a new modeling approach for speaker recognition that uses the maximum-likelihood linear regression (MLLR) adaptation transforms employed by a speech recognition system as features for support vector machine…
-
Detecting and Summarizing Action Items in Multi-Party Dialogue
This paper addresses the problem of identifying action items discussed in open-domain conversational speech, and does so in two stages: firstly, detecting the subdialogues in which action items are proposed,…
-
Resolving ”You” in Multi-Party Dialog
This paper presents experiments into the resolution of “you” in multi-party dialog, dividing this process into two tasks: distinguishing between generic and referential uses; and then, for referential uses, identifying…
-
IraqComm: A Next Generation Translation System
This paper describes the IraqComm translation system that mediates and translates spontaneous conversations between an English speaker and a speaker of colloquial Iraqi Arabic.
-
Duration and Pronunciation Conditioned Lexical Modeling for Speaker Verification
We propose a method to improve speaker recognition lexical model performance using acoustic-prosodic information. More specifically, the lexical model is trained using duration- and pronunciation-conditioned word N-grams, simultaneously modeling lexical…
-
Detecting Deception Using Critical Segments
We present an investigation of segments that map to GLOBAL LIES, that is, the intent to deceive with respect to salient topics of the discourse. We propose that identifying the…
-
Integrating MAP, Marginals, and Unsupervised Language Model Adaptation
We investigate the integration of various language model adaptation approaches for a cross-genre adaptation task to improve Mandarin ASR system performance on a recently introduced new genre, broadcast conversation (BC).
-
fMPE-MAP: Improved Discriminative Adaptation for Modeling New Domains
This paper introduces a new adaptation approach, fMPE-MAP, which is an extension to the original fMPE (feature minimum phone error) algorithm, with the enhanced ability in porting Gaussian models and…
-
A Semi-Supervised Learning Approach for Morpheme Segmentation for an Arabic Dialect
We evaluate our approach by applying morpheme segmentation to the training data of a statistical machine translation (SMT) system. Experiments show that our approach is less sensitive to the availability…
-
A Smoothing Kernel for Spatially Related Features and Its Application to Speaker Verification
Most commonly used kernels are invariant to permutations of the feature vector components. We will consider one such case, where the features are spatially related and show a way to…
-
Advances in Mandarin Broadcast Speech Recognition
We describe our continuing efforts to improve the UW-SRI-ICSI Mandarin broadcast speech recognizer.
-
The SRI/OGI 2006 Spoken Term Detection System
This paper describes the system developed jointly at SRI and OGI for participation in the 2006 NIST Spoken Term Detection (STD) evaluation.