Skip to content

Research
Commercialization
About
- People
- News & Stories
- Events
- Our History
- Contact
Innovating In
Careers
日本支社

Research
Commercialization
About
- People
- News & Stories
- Events
- Our History
- Contact
Innovating In
Careers
日本支社

Search sri.com

June 1, 2018

Voices Obscured in Complex Environmental Settings (VOiCES) corpus

Colleen Richey, Horacio Franco, Aaron Lawson, Allen Stauffer

Citation

C. Richey, M. A. Barrios, Z. Armstrong, C. Bartels, H. Franco, M. Graciarena, A. Lawson, M. K. Nandwana, A. Stauffer, J. van Hout, P. Gamble, J. Hetherly, C. Stephenson and K. Ni, “Voices obscured in complex environmental settings (VOiCES) corpus,” Interspeech 2018, Hyderabad, Telangana, India. Forthcoming September 2018.

Abstract

This paper introduces the Voices Obscured in Complex Environmental Settings (VOiCES) corpus, a freely available dataset under Creative Commons BY 4.0. This dataset will promote speech and signal processing research of speech recorded by far-field microphones in noisy room conditions. Publicly available speech corpora are mostly composed of isolated speech at close-range microphony. A typical approach to better represent realistic scenarios, is to convolve clean speech with noise and simulated room response for model training. Despite these efforts, model performance degrades when tested against uncurated speech in natural conditions. For this corpus, audio was recorded in furnished rooms with background noise played in conjunction with foreground speech selected from the LibriSpeech corpus. Multiple sessions were recorded in each room to accommodate for all foreground speech-background noise combinations. Audio was recorded using twelve microphones placed throughout the room, resulting in 120 hours of audio per microphone. This work is a multi-organizational effort led by SRI International and Lab41 with the intent to push forward state-of-the-art distant microphone approaches in signal processing and speech recognition.

↓ Review online

Read more from SRI

July 23, 2026

SRI-backed Valence AI raises $5M to integrate emotional intelligence into the trust stack

Emotional inference is becoming a new layer in digital identity and fraud-prevention systems.
July 16, 2026

Tackling quantum scalability with NIST-backed QMEC

SRI-led QMEC will find opportunities and identify gaps within the quantum supply chain.
July 14, 2026

A thousand qubits in bloom, now let’s scale

Standardization is the way to the first quantum computer.

Join Our Team

Build your own legacy

Explore careers

Hire Us

Solutions to your most complex challenges

Send an inquiry

Contact Us

General inquiries

Get the latest news from SRI

Commercialization

Media Inquiries

333 Ravenswood Ave
Menlo Park, CA 94025 USA

+1 (650) 859-2000

© 2026 SRI INTERNATIONAL

Manage Cookie Consent

To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.

Functional Functional Always active

The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.

Preferences Preferences

The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.

Statistics Statistics

The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.

Marketing Marketing

The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.

Manage options
Manage services
Manage {vendor_count} vendors
Read more about these purposes

View preferences

{title}
{title}
{title}