Author: Dimitra Vergyri

April 17, 2020

Speech‐based markers for post traumatic stress disorder in US veterans

This study demonstrates that a speech-based algorithm can objectively differentiate PTSD cases from controls.
December 1, 2017

Tackling Unseen Acoustic Conditions in Query-by-Example Search Using Time and Frequency Convolution for Multilingual Deep Bottleneck Features

This paper revisits two neural network architectures developed for noise and channel robust ASR, and applies them to building a state-of-art multilingual QbE system.
March 1, 2017

Toward human-assisted lexical unit discovery without text resources

This work addresses lexical unit discovery for languages without (usable) written resources.
March 1, 2017

Speech recognition in unseen and noisy channel conditions

This work investigates robust features, feature-space maximum likelihood linear regression (fMLLR) transform, and deep convolutional nets to address the problem of unseen channel and noise conditions in speech recognition.
March 1, 2017

Joint modeling of articulatory and acoustic spaces for continuous speech recognition tasks

This paper investigates using deep neural networks (DNN) and convolutional neural networks (CNNs) for mapping speech data into its corresponding articulatory space.
September 1, 2016

Unsupervised Learning of Acoustic Units Using Autoencoders and Kohonen Nets

This work investigates learning acoustic units in an unsupervised manner from real-world speech data by using a cascade of an autoencoder and a Kohonen net.
September 1, 2016

Fusion Strategies for Robust Speech Recognition and Keyword Spotting for Channel- and Noise-Degraded Speech

Current state-of-the-art automatic speech recognition systems are sensitive to changing acoustic conditions, which can cause significant performance degradation.
December 1, 2015

Improving robustness against reverberation for automatic speech recognition

In this work, we explore the role of robust acoustic features motivated by human speech perception studies, for building ASR systems robust to reverberation effects.
September 1, 2015

Speech-based assessment of PTSD in a military population using diverse feature classes

We analyzed recordings of the Clinician-Administered PTSD Scale (CAPS) interview from military personnel diagnosed as PTSD positive versus negative.
April 1, 2015

Cross-corpus depression prediction from speech

We study a new corpus of patient-clinician interactions recorded when patients are admitted to a hospital for suicide risk and again when they are released.
November 1, 2014

The SRI AVEC-2014 Evaluation System

We explore a diverse set of features based only on spoken audio to understand which features correlate with self-reported depression scores according to the Beck depression rating scale.
September 1, 2014

Recent Improvements in SRI’s Keyword Detection System for Noisy Audio

We present improvements to a keyword spotting (KWS) system that operates in highly adverse channel conditions with very low signal-to-noise ratio levels.