ASR Error Detection Using Recurrent Neural Network Language Model and Complementary ASR

Citation

Y. Tam, Y. Lei, J. Zheng and W. Wang, “ASR error detection using recurrent neural network language model and complementary ASR,” 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2014, pp. 2312-2316, doi: 10.1109/ICASSP.2014.6854012.

Abstract

Detecting automatic speech recognition (ASR) errors can play an important role for effective human-computer spoken dialogue system, as recognition errors can hinder accurate system understanding of user intents.  Our goal is to locate errors in an utterance so that the dialogue manager can pose appropriate clarification questions to the users.  We propose two approaches to improve ASR error detection:  (1) using recurrent neural network language models to capture long-distance word context within and across previous utterances; (2) using a complementary ASR system. The intuition is that when two complementary ASR systems disagree on a region in an utterance, this region is most likely an error.  We train a neural network predictor of errors using a variety of features.  We performed experiments on both English and Iraqi Arabic ASR and observed significant improvement in error detection using the proposed methods.

Index Terms— ASR error detection, recurrent neural network language model, deep neural network acoustic model, complementary ASR


Read more from SRI

  • Banner and attendees at the IEEE Hard Tech Venture Summit

    Cultivating hard tech startups that scale

    IEEE’s Hard Tech Venture Summit convened innovators at SRI to refine strategies and build new networks.

  • Patient going into a MRI

    Bringing surgical tools inside the MRI

    Drawing on SRI’s unique innovation ecosystem, the startup Medical Devices Corner is seeking to improve cancer surgery by advancing MRI-safe teleoperation.

  • Christopher Mims and Susan Patrick

    PARC Forum: How to AI

    The Wall Street Journal tech columnist Christopher Mims and SRI Education’s Susan Patrick discuss how AI can strengthen human agency.