Advances in Mandarin Broadcast Speech Recognition

Citation

Hwang, M. Y., Wang, W., Lei, X., Zheng, J., Cetin, O., & Peng, G. (2007). Advances in Mandarin broadcast speech recognition. In Eighth Annual Conference of the International Speech Communication Association.

Abstract

We describe our continuing efforts to improve the UW-SRI-ICSI Mandarin broadcast speech recognizer. This includes increasing acoustic and text training data, adding discriminative features, incorporating frame-level discriminative training criterion, multiplepass acoustic model (AM) cross adaptation, language model (LM) genre adaptation and system combination. The net effect without LM adaptation was a 24-64 pct. relative reduction in character error rates (CERs) on a variety of test sets. In addition, LM adaptation gave us another 6 pct. of relative CER reduction on broadcast conversations.


Read more from SRI

  • surgeons around a surgical robot

    The SRI research behind today’s surgical robotics

    Intuitive’s da Vinci 5 system represents a major leap in robotic-assisted medicine. It all started at SRI, which continues to advance teleoperation technologies.

  • a collage of digital graphs

    A banner year for quantum

    SRI-managed QED-C’s annual report on quantum trends captures an industry accelerating rapidly from technical promise toward major global impact.

  • ICE Cube containing SRI’s aerogel experiment, photographed prior to launch. Source: Aerospace Applications North America

    An SRI carbon capture experiment launches into space

    By synthesizing carbon-absorbing aerogels in microgravity, SRI research will give us a rare glimpse into how these materials could be radically improved.