Speech & natural language publications
-
Comparison of Neutralizing Abilities of Human Monoclonal Antibodies Binding Different Epitopes on Botulinum Neurotoxin A
-
Improving Language Identification Robustness to Highly Channel-Degraded Speech through Multiple System Fusion
We describe a language identification system developed for robustess to noise conditions such as those encountered under the DARPA RATS program, which is focused on multi-channel audio collected in high…
-
Modulation features for noise robust speaker identification
In this paper, we present a robust acoustic feature on top of robust modeling techniques to further improve speaker identification performance.
-
Strategies for high accuracy keyword detection in noisy channels
We present design strategies for a keyword spotting (KWS) system that operates in highly degraded channel conditions with very low signal-to-noise ratio levels.
-
A Noise-Robust System for NIST 2012 Speaker Recognition Evaluation
This paper presents SRI’s submission along with a careful analysis of the approaches that provided gains for this challenging evaluation including a multiclass voice-activity detection system, the use of noisy…
-
All for one: Feature combination for highly channel-degraded speech activity detection
This paper presents a feature combination approach to improve SAD on highly channel degraded speech as part of the Defense Advanced Research Projects Agency’s (DARPA) Robust Automatic Transcription of Speech…
-
Damped oscillator cepstral coefficients for robust speech recognition
This paper presents a new signal-processing technique motivated by the physiology of human auditory system.
-
Adaptive Gaussian Backend for Robust Language Identification
This paper proposes adaptive Gaussian backend (AGB), a novel approach to robust language identification (LID).
-
Rich system combination for keyword spotting in noisy and acoustically heterogeneous audio streams
We address the problem of retrieving spoken information from noisy and heterogeneous audio archives using a rich system combination with a diverse set of noise-robust modules and audio characterization.
-
Articulatory trajectories for large-vocabulary speech recognition
We present a neural network model to estimate articulatory trajectories from speech signals where the model was trained using synthetic speech signals generated by Haskins Laboratories’ task-dynamic model of speech…
-
A noise robust i-vector extractor using vector taylor series for speaker recognition
We propose a novel approach for noise-robust speaker recognition, where the model of distortions caused by additive and convolutive noises is integrated into the i-vector extraction framework.
-
Fusion of acoustic, perceptual and production features for robust speech recognition non-stationary noise
This paper shows that fusion of multiple noise robust feature streams motivated by speech production and perception theories help to significantly improve the robustness of traditional speech recognition systems.