Publications
-
MLLR Transforms as Features in Speaker Recognition
We explore the use of adaptation transforms employed in speech recognition systems as features for speaker recognition. This approach is attractive because, unlike standard frame-based cepstral speaker recognition models, it…
-
Robust Feature Compensation in Nonstationary and Multiple Noise Environments
We extend the POF algorithm to allow a more accurate way to select noisy-to-clean feature mappings, by allowing different combinations of speech and noise to have combination-specific mappings selected depending…
-
Two Experiments Comparing Reading with Listening for Human Processing of Conversational Telephone Speech
We report on results of two experiments designed to compare subjects’ ability to extract information from audio recordings of conversational telephone speech (CTS) with their ability to extract information from…
-
Development of a Conversational Telephone Speech Recognizer for Levantine Arabic
In this paper, we describe the development of a large-vocabulary speech recognition system for Levantine Arabic, which was a new dialectal recognition task for our existing system. We discuss the…
-
Meeting Structure Annotation: Data and Tools
We present a set of annotations of hierarchical topic segmentations and action item sub-dialogues collected over 65 meetings from the ICSI and ISL meeting corpora, designed to support automatic meeting…
-
Spoken Language Understanding
SLU systems contain an automatic speech recognition (ASR) component and must be robust to noise due to the spontaneous nature of spoken language and the errors introduced by ASR. SLU…
-
Does Active Learning Help Automatic Dialog Act Tagging in Meeting Data?
We ask if active learning with lexical cues can help for this task and this domain. To better address this question, we explore active learning for two different types of…
-
Pushing the Envelope — Aside
Despite successes, there are still significant limitations to speech recognition performance. For this reason, authors have proposed methods that incorporate different (and larger) analysis windows, which are described in this…
-
A Personalized Time Management Assistant: Research Directions
This paper presents ongoing work to build the Personalized Time Manager (PTIME) system, a persistent assistant that builds on our previous work on a personalized calendar agent (PCalM) (Berry et…
-
A Robust Method for Tracking Scene Text in Video Imagery
We describe an approach that tracks planar regions of scene text that can undergo arbitrary 3-D rigid motion and scale changes. Our approach computes homographies on blocks of contiguous frames…
-
Collaborative and argumentative models of natural discussions
We report in this paper experiences and insights resulting from the first two years of work in two similar projects on meeting tracking and understanding. The projects are the DARPA-funded…
-
Identifying and Segmenting Human-Motion for Mobile Robot Navigation using alignment errors
This paper presents a new human-motion identification and segmentation algorithm from moving cameras. The algorithm is based on alignment error between pairs of moving object images. Pairs of object images…