June 1, 2018

Approaches to multi-domain language recognition

Citation

M, McLaren, M. Kumar Nandwana, D. Castan and L. Ferrer. Approaches to multi-domain language recognition. Speaker Odyssey 2018. Forthcoming June 2018.

Abstract

Multi-domain language recognition involves the application of a language identification (LID) system to identify languages in more than one domain. This problem was the focus of the recent NIST LRE 2017, and this article presents the findings from the SRI team during system development for the evaluation. Approaches found to provide robustness in multi-domain LID include a domain-and-language-weighted Gaussian backend classifier, duration-aware calibration, and a source normalized multi-resolution neural network backend. The recently developed speaker embeddings technology is also applied to the task of language recognition, showing great potential for future LID research.

↓ Download

Approaches to multi-domain language recognition

Abstract

Read more from SRI

Podcast: How students can help drive educational innovation

SRI researchers develop rugged, low-cost, drifting sensors to learn more about the oceans

An SRI collaboration aims to improve online education for college students