SRI Authors: Mitchell McLaren, Diego Castán
M, McLaren, M. Kumar Nandwana, D. Castan and L. Ferrer. Approaches to multi-domain language recognition. Speaker Odyssey 2018. Forthcoming June 2018.
Multi-domain language recognition involves the application of a language identification (LID) system to identify languages in more than one domain. This problem was the focus of the recent NIST LRE 2017, and this article presents the findings from the SRI team during system development for the evaluation. Approaches found to provide robustness in multi-domain LID include a domain-and-language-weighted Gaussian backend classifier, duration-aware calibration, and a source normalized multi-resolution neural network backend. The recently developed speaker embeddings technology is also applied to the task of language recognition, showing great potential for future LID research.