Advances in Mandarin Broadcast Speech Recognition

Citation

Hwang, M. Y., Wang, W., Lei, X., Zheng, J., Cetin, O., & Peng, G. (2007). Advances in Mandarin broadcast speech recognition. In Eighth Annual Conference of the International Speech Communication Association.

Abstract

We describe our continuing efforts to improve the UW-SRI-ICSI Mandarin broadcast speech recognizer. This includes increasing acoustic and text training data, adding discriminative features, incorporating frame-level discriminative training criterion, multiplepass acoustic model (AM) cross adaptation, language model (LM) genre adaptation and system combination. The net effect without LM adaptation was a 24-64 pct. relative reduction in character error rates (CERs) on a variety of test sets. In addition, LM adaptation gave us another 6 pct. of relative CER reduction on broadcast conversations.


Read more from SRI

  • An arid, rural Nevada landscape

    Can AI help us find valuable minerals?

    SRI’s machine learning-based geospatial analytics platform, already adopted by the USGS, is poised to make waves in the mining industry.

  • Two students in a computer lab

    Building a lab-to-market pipeline for education

    The SRI-led LEARN Network demonstrates how we can get the best evidence-based educational programs to classrooms and students.

  • Code reflected in a man's eyeglasses

    LLM risks from A to Z

    A new paper from SRI and Brazil’s Instituto Eldorado delivers a comprehensive update on the security risks to large language models.