This paper assesses the role of robust acoustic features in spoken term detection (a.k.a keyword spotting—KWS) under heavily degraded channel and noise corrupted conditions.
Damped oscillator cepstral coefficients for robust speech recognition
This paper presents a new signal-processing technique motivated by the physiology of human auditory system.
Strategies for high accuracy keyword detection in noisy channels
We present design strategies for a keyword spotting (KWS) system that operates in highly degraded channel conditions with very low signal-to-noise ratio levels.
All for one: Feature combination for highly channel-degraded speech activity detection
This paper presents a feature combination approach to improve SAD on highly channel degraded speech as part of the Defense Advanced Research Projects Agency’s (DARPA) Robust Automatic Transcription of Speech (RATS) program.
Modulation features for noise robust speaker identification
In this paper, we present a robust acoustic feature on top of robust modeling techniques to further improve speaker identification performance.
Normalized amplitude modulation features for large vocabulary noise-robust speech recognition
In this work, we present an amplitude modulation feature derived from Teager’s nonlinear energy operator that is power normalized and cosine transformed to produce normalized modulation cepstral coefficient (NMCC) features…
Robust speech representation of voiced sounds based on synchrony determiniation with PLLS
We propose to include synchrony effects, known to exist in the auditory system, to represent voiced parts of the speech signal in a robust way.
EduSpeak®: A Speech Recognition and Pronunciation Scoring Toolkit for Computer-Aided Language Learning Applications
SRI International’s EduSpeak® system is a SDK that enables developers of interactive language education software to use state-of-the-art speech recognition and pronunciation scoring technology.
Recent advances in SRI’s IraqComm Iraqi Arabic-English speech-to-speech translation system
We summarize recent progress on SRI’s IraqComm™ IraqiArabic-English two-way speech-to-speech translation system.