Integrating Neural Networks into Computer Speech Recognition Systems

,

Citation

Cohen, M., Franco, H., Morgan, N., Rumelhart, D., Abrash, V., & Konig, Y. Integrating Neural Networks Into Computer Speech Recognition Systems.

Introduction

Most current state-of-the-art continuous-speech recognition systems are based on hidden Markov modeling techniques. The work described here involved integrating neural networks into a hidden Markov model-based state-of-the-art continuous-speech recognition system, resulting in improvements in recognition accuracy and reductions in model complexity. Hidden Markov models (HMMs) may be thought of as doubly stochastic finite state machines, consisting of a set of states, transition probabilities between states, and probability distributions over output symbols associated with each state. When used to model speech, these output symbols represent acoustic observations, modeling subphonetic acoustic events (e.g., closures, bursts, transitions). Current HMM-based speech recognition systems typically model phonetic units, or “phones” (e.g., the sound “m” in the word “map”), with a sequence of such states. Sequences of phone models can be concatenated to form word models. Word models can be connected according to grammatical constraints forming large networks that model any allowable sentence within an application. This approach allows a hierarchy of levels of linguistic description to be encoded within a uniform mathematical framework.


Read more from SRI

  • Banner and attendees at the IEEE Hard Tech Venture Summit

    Cultivating hard tech startups that scale

    IEEE’s Hard Tech Venture Summit convened innovators at SRI to refine strategies and build new networks.

  • Patient going into a MRI

    Bringing surgical tools inside the MRI

    Drawing on SRI’s unique innovation ecosystem, the startup Medical Devices Corner is seeking to improve cancer surgery by advancing MRI-safe teleoperation.

  • Christopher Mims and Susan Patrick

    PARC Forum: How to AI

    The Wall Street Journal tech columnist Christopher Mims and SRI Education’s Susan Patrick discuss how AI can strengthen human agency.