Modeling Consistency in a Speaker Independent Continuous Speech Recognition System

Citation

Konig, Y., Morgan, N., Wooters, C., Abrash, V., Cohen, M., & Franco, H. (1992). Modeling consistency in a speaker independent continuous speech recognition system. Advances in Neural Information Processing Systems, 5.

Abstract

We would like to incorporate speaker-dependent consistencies, such as gender, in an otherwise speaker-independent speech recognition system. In this paper we discuss a Gender Dependent Neural Network (GDNN) which can be tuned for each gender, while sharing most of the speaker independent parameters. We use a classification network to help generate gender-dependent phonetic probabilities for a statistical (HMM) recognition system. The gender classification net predicts the gender with high accuracy, 98.3 pct on a Resource Management test set. However, the integration of the GDNN into our hybrid HMM-neural network recognizer provided an improvement in the recognition score that is not statistically significant on a Resource Management test set.


Read more from SRI

  • Collage of Douglas Engelbart at the Mother of All Demos and a modern computer mouse

    Stanford celebrates a world-changing SRI invention

    Spotlighting Douglas Engelbart’s invention of the computer mouse, Stanford Magazine revisits a moment when SRI transformed computing forever.

  • Two IT professionals solving a problem

    Why quantum assurance matters

    New SRI research seeks to secure the future of quantum innovation by extending software assurance capabilities from classical computers to quantum information systems.

  • PARC Forum Participants

    PARC Forum: The future of defense technologies

    Silicon Valley is paying close attention to the defense sector. SRI convened a conversation exploring new opportunities to advance security through innovation.