Connectionist Speaker Normalization and Adaptation

,

Citation

Abrash, V., Franco, H., Sankar, A., & Cohen, M. (1995). Connectionist speaker normalization and adaptation. In in Eurospeech.

Abstract

In a speaker-independent, large-vocabulary continuous speech recognition systems, recognition accuracy varies considerably from speaker to speaker, and performance may be significantly degraded for outlier speakers such as nonnative talkers. In this paper, we explore supervised speaker adaptation and normalization in the MLP component of a hybrid hidden Markov model/multilayer perceptron version of SRI’s DECIPHER™ speech recognition system. Normalization is implemented through an additional transformation network that preprocesses the cepstral input to the MLP. Adaptation is accomplished through incremental retraining of the MLP weights on adaptation data. Our approach combines both adaptation and normalization in a single, consistent manner, works with limited adaptation data, and is text-independent. We show significant improvement in recognition accuracy.


Read more from SRI

  • Collage of Douglas Engelbart at the Mother of All Demos and a modern computer mouse

    Stanford celebrates a world-changing SRI invention

    Spotlighting Douglas Engelbart’s invention of the computer mouse, Stanford Magazine revisits a moment when SRI transformed computing forever.

  • Two IT professionals solving a problem

    Why quantum assurance matters

    New SRI research seeks to secure the future of quantum innovation by extending software assurance capabilities from classical computers to quantum information systems.

  • PARC Forum Participants

    PARC Forum: The future of defense technologies

    Silicon Valley is paying close attention to the defense sector. SRI convened a conversation exploring new opportunities to advance security through innovation.