Conference Paper May 1, 2015

A cross-language study on automatic speech disfluency detection

We investigate two systems for automatic disfluency detection on English and Mandarin conversational speech data. The first system combines various lexical and prosodic features in a Conditional Random Field model...

Conference Paper May 1, 2014

Highly Accurate Phonetic Segmentation Using Boundary Correction Models and System Fusion

Accurate phone-level segmentation of speech remains an important task for many subfields of speech research. We investigate techniques for boosting the accuracy of automatic phonetic segmentation based on HMM acoustic-phonetic...

Conference Paper August 1, 2013

Automatic phonetic segmentation using boundary models

This study attempts to improve automatic phonetic segmentation within the HMM framework. Experiments were conducted to investigate the use of phone boundary models, the use of precise phonetic segmentation for...

Conference Paper May 1, 2013

Using multiple versions of speech input in phone recognition

This study investigates the use of multiple versions of the same speech unit in automatic phone recognition. Two methods were applied to combine multiple utterance versions in decoding: cross forced-alignment...

Conference Paper May 1, 2013

Articulatory trajectories for large-vocabulary speech recognition

Studies have demonstrated that articulatory information can model speech variability effectively and can potentially help to improve speech recognition performance. Most of the studies involving articulatory information have focused on...

Conference Paper June 1, 2012

Effects of audio and ASR quality on cepstral and high-level speaker verification systems

Speech data for NIST speaker recognition evaluations has traditionally been distributed in compressed, telephone quality form, even for microphone data that was originally recorded at higher quality. We evaluate the...

Conference Paper January 1, 2012

Speaker recognition with region-constrained MLLR transforms

It has been shown that standard cepstral speaker recognition models can be enhanced by em region-constrained models, where features are extracted only from certain speech regions defined by linguistic or...

Conference Paper December 1, 2011

SRILM at sixteen: Update and outlook

We review developments in the SRI Language Modeling Toolkit (SRILM) since 2002, when a previous paper on SRILM was published. These developments include measures to make training from large data...

Conference Paper August 1, 2011

Constrained cepstral speaker recognition using matched UBM and JFA training

We study constrained speaker recognition systems, or systems that model standard cepstral features that fall within particular types of speech regions. A question in modeling such systems is whether to...

Conference Paper August 1, 2011

Effective Arabic dialect classification using diverse phonotactic models

We study the effectiveness of recently developed language recognition techniques based on speech recognition models for the discrimination of Arabic dialects. Specifically, we investigate dialect-specific and cross-dialectal phonotactic models, using...

Conference Paper May 1, 2011

Making the most from multiple microphones in meeting recognition

The use of multiple distant microphones has been widely studied for meeting recognition. The two most widely used approaches are 1)combination at the signal level, via blind beamforming, followed by...

Conference Paper May 1, 2011

Bird species recognition combining acoustic and sequence modeling

The goal of this work was to explore modeling techniques to improve bird species classification from audio samples. We first developed an unsupervised approach to obtain approximate note models from...

