Analysis of Complementary Information Sources in the Speaker Embeddings Framework

,

Citation

M. Kumar Nandwana, M. McLaren, D. Castan, Julien van Hout and A. Lawson, “Analysis of complementary information sources in the speaker embeddings framework,” Interspeech 2018, Hyderabad, Telangana, India. Forthcoming September 2018.

Abstract

Deep neural network (DNN)-based speaker embeddings have resulted in new, state-of-the-art text-independent speaker recognition technology. However, very limited effort has been made to understand DNN speaker embeddings. In this study, our aim is analyzing the behavior of the speaker recognition systems based on speaker embeddings toward different front-end features, including the standard Mel frequency cepstral coefficients (MFCC), as well as power normalized cepstral coefficients (PNCC), and perceptual linear prediction (PLP). Using a speaker recognition system based on DNN speaker embeddings and probabilistic linear discriminant analysis (PLDA), we compared different approaches to leveraging complementary information using score-, embeddings-, and feature-level combination. We report our results for Speakers in the Wild (SITW) and NIST SRE 2016 datasets. We found that first and second embeddings layers are complementary in nature. By applying score and embedding-level fusion we demonstrate relative improvements in equal error rate of 17% on NIST SRE 2016 and 10% on SITW over the baseline system.


Read more from SRI

  • Collage of Douglas Engelbart at the Mother of All Demos and a modern computer mouse

    Stanford celebrates a world-changing SRI invention

    Spotlighting Douglas Engelbart’s invention of the computer mouse, Stanford Magazine revisits a moment when SRI transformed computing forever.

  • Two IT professionals solving a problem

    Why quantum assurance matters

    New SRI research seeks to secure the future of quantum innovation by extending software assurance capabilities from classical computers to quantum information systems.

  • PARC Forum Participants

    PARC Forum: The future of defense technologies

    Silicon Valley is paying close attention to the defense sector. SRI convened a conversation exploring new opportunities to advance security through innovation.