September 1, 2016

The 2016 Speakers in the Wild Speaker Recognition Evaluation

Citation

M. McLaren, L. Ferrer, D. Castan and A. Lawson, “The 2016 Speakers in the Wild Speaker Recognition Evaluation,” in Proc. INTERSPEECH 2016, pp. 823-827, September 2016.

Abstract

The newly collected Speakers in the Wild (SITW) database was central to a text-independent speaker recognition challenge held as part of a special session at Interspeech 2016. The SITW database is composed of audio recordings from 299 speakers collected from open source media, with an average of 8 sessions per speaker. The recordings contain unconstrained or “wild” acoustic conditions, rarely found in large speaker recognition datasets, and multi-speaker recordings for both speaker enrollment and verification. This article provides details of the SITW speaker recognition challenge and analysis of evaluation results. There were 25 international teams involved in the challenge of which 11 teams participated in an evaluation track. Teams were tasked with applying existing and novel speaker recognition algorithms to the challenges associated with the real world conditions of SITW. We provide an analysis of some of the top performing systems submitted during the evaluation and provide future research directions.

↓ Download

The 2016 Speakers in the Wild Speaker Recognition Evaluation

Abstract

Read more from SRI

Researchers develop materials that can take on the toughest conditions

Podcast: Re-imagining instructional quality and coaching

SRI’s Genome Explorer: Enhanced genome browser delivers better user experience