June 1, 2018

Approaches to multi-domain language recognition

Citation

M, McLaren, M. Kumar Nandwana, D. Castan and L. Ferrer. Approaches to multi-domain language recognition. Speaker Odyssey 2018. Forthcoming June 2018.

Abstract

Multi-domain language recognition involves the application of a language identification (LID) system to identify languages in more than one domain. This problem was the focus of the recent NIST LRE 2017, and this article presents the findings from the SRI team during system development for the evaluation. Approaches found to provide robustness in multi-domain LID include a domain-and-language-weighted Gaussian backend classifier, duration-aware calibration, and a source normalized multi-resolution neural network backend. The recently developed speaker embeddings technology is also applied to the task of language recognition, showing great potential for future LID research.

↓ Download

Approaches to multi-domain language recognition

Abstract

Read more from SRI

Researchers develop materials that can take on the toughest conditions

Podcast: Re-imagining instructional quality and coaching

SRI’s Genome Explorer: Enhanced genome browser delivers better user experience