Publications Search | Page 7 | SRI International

Toggle Menu

Publications Search

By Julien van Hout, Dimitra Vergyri, Vikramjit Mitra

State-of-the-art calibration and fusion approaches for spoken term detection (STD) systems currently rely on a multi-pass approach where the scores are calibrated, then fused, and finally re-calibrated to obtain a single decision threshold across keywords.

May, 2014
In Proceedings
418
By Elizabeth Shriberg

Recent studies have shown the importance of using online videos along with textual material in educational instruction, especially for better content retention and improved concept understanding. A key question is how to select videos to maximize student engagement, particularly when there are...

May, 2014
In Proceedings
418

This paper assesses the role of robust acoustic features in spoken term detection (a.k.a keyword spotting—KWS) under heavily degraded channel and noise corrupted conditions.

May, 2014
In Proceedings
418

We address the problem of subselecting a large set of acoustic data to train automatic speech recognition (ASR) systems.

May, 2014
In Proceedings
418

We present a system for detecting lexical stress in English words spoken by English learners.

May, 2014
In Proceedings
418

Recently, a new version of the iVector modelling has been proposed for noise robust speaker recognition, where the nonlinear function that relates clean and noisy cepstral coefficients is approximated by a first order vector Taylor series (VTS). In this paper, it is proposed to substitute the first...

May, 2014
In Proceedings
418

Though sparse features have produced significant gains over traditional dense features in statistical machine translation, careful feature selection and feature engineering are necessary to avoid overfitting in optimizations.

May, 2014
In Proceedings
418
By Wen Wang, Vikramjit Mitra

Accurate phone-level segmentation of speech remains an important task for many subfields of speech research.

May, 2014
In Proceedings
418

The state-of-the-art in example-based multimedia event detection (MED) rests on heterogeneous classifiers whose scores are typically combined in a late-fusion scheme.

May, 2014
In Proceedings
418

Recently, researchers have tackled difficult voice biometrics problems that resonate with the defense and research communities.

Nov, 2013
In Proceedings
418

Pages