Constrained cepstral speaker recognition using matched UBM and JFA training

Citation

M. H. Sanchez, L. Ferrer, E. Shriberg, and A. Stolcke, “Constrained cepstral speaker recognition using matched UBM and JFA training,” in Proc. Interspeech, 2011, pp. 737–740.

Abstract

We study constrained speaker recognition systems, or systems that model standard cepstral features that fall within particular types of speech regions.  A question in modeling such systems is whether to constrain universal background model (UBM) training, joint factor analysis (JFA), or both.  We explore this question, as well as how to optimize UBM model size, using a corpus of Arabic male speakers.  Over a large set of phonetic and prosodic constraints, we find that the performance of a system using constrained JFA and UBM is on average 5.24% better than when using constraint-independent (all frames) JFA and UBM.  We find further improvement from optimizing UBM size based on the percentage of frames covered by the constraint.

Index Terms: Speaker Recognition, Cepstral Features, Constraints, Joint Factor Analysis


Read more from SRI

  • surgeons around a surgical robot

    The SRI research behind today’s surgical robotics

    Intuitive’s da Vinci 5 system represents a major leap in robotic-assisted medicine. It all started at SRI, which continues to advance teleoperation technologies.

  • a collage of digital graphs

    A banner year for quantum

    SRI-managed QED-C’s annual report on quantum trends captures an industry accelerating rapidly from technical promise toward major global impact.

  • ICE Cube containing SRI’s aerogel experiment, photographed prior to launch. Source: Aerospace Applications North America

    An SRI carbon capture experiment launches into space

    By synthesizing carbon-absorbing aerogels in microgravity, SRI research will give us a rare glimpse into how these materials could be radically improved.