Speech & natural language publications

September 1, 2005

Meeting Structure Annotation: Data and Tools

ByJohn Niekrasz

We present a set of annotations of hierarchical topic segmentations and action item sub-dialogues collected over 65 meetings from the ICSI and ISL meeting corpora, designed to support automatic meeting…

Publications, Speech & natural language publications
September 1, 2005

MLLR Transforms as Features in Speaker Recognition

We explore the use of adaptation transforms employed in speech recognition systems as features for speaker recognition. This approach is attractive because, unlike standard frame-based cepstral speaker recognition models, it…

Publications, Speech & natural language publications
September 1, 2005

Speech Translation for Low-Resource Languages: The Case of Pashto

ByKristin Precoda, Dimitra Vergyri, Andreas Kathol

We present a number of challenges and solutions that have arisen in the development of a speech translation system for American English and Pashto, highlighting those specific to a very…

Publications, Speech & natural language publications
September 1, 2005

Robust Feature Compensation in Nonstationary and Multiple Noise Environments

ByMartin Graciarena, Horacio Franco, Victor Abrash

We extend the POF algorithm to allow a more accurate way to select noisy-to-clean feature mappings, by allowing different combinations of speech and noise to have combination-specific mappings selected depending…

Publications, Speech & natural language publications
September 1, 2005

Distinguishing Deceptive from Non-Deceptive Speech

ByAndreas Kathol, Martin Graciarena

We present results from a study seeking to distinguish deceptive from non-deceptive speech using machine learning techniques on features extracted from a large corpus of deceptive and non-deceptive speech. We…

Publications, Speech & natural language publications
September 1, 2005

Comparing HMM, Maximum Entropy, and Conditional Random Fields for Disfluency Detection

We compare a generative hidden Markov model (HMM)-based approach and two conditional models — a maximum entropy (Maxent) model and a conditional random field (CRF) — for detecting disfluencies in…

Publications, Speech & natural language publications
September 1, 2005

Does Active Learning Help Automatic Dialog Act Tagging in Meeting Data?

We ask if active learning with lexical cues can help for this task and this domain. To better address this question, we explore active learning for two different types of…

Publications, Speech & natural language publications
September 1, 2005

Using MLP Features in SRI’s Conversational Speech Recognition System

We describe the development of a speech recognition system for conversational telephone speech (CTS) that incorporates acoustic features estimated by multilayer perceptrons (MLP). The acoustic features are based on frame-level…

Publications, Speech & natural language publications
July 1, 2005

Collaborative and argumentative models of natural discussions

ByJohn Niekrasz

We report in this paper experiences and insights resulting from the first two years of work in two similar projects on meeting tracking and understanding. The projects are the DARPA-funded…

Publications, Speech & natural language publications
June 1, 2005

Using Conditional Random Fields for Sentence Boundary Detection in Speech

In this paper, we evaluate the use of a conditional random field (CRF) for this task and relate results with this model to our prior work. We evaluate across two…

Publications, Speech & natural language publications
April 1, 2005

Ontology-based multi-party meeting understanding

ByJohn Niekrasz

This paper describes current and planned research efforts towards developing multimodal discourse understanding for an automated personal office assistant.

Publications, Speech & natural language publications
March 1, 2005

Structural Metadata Research in the EARS Program

In this paper we provide a brief overview of research on structural metadata extraction in the DARPA EARS rich transcription program. Tasks include detection of sentence boundaries, filler words, and…

Publications, Speech & natural language publications