Combining Discriminative Re-Ranking and Co-Training for Parsing Mandarin Speech Transcripts

Citation

W. Wang, “Combining discriminative re-ranking and co-training for parsing Mandarin speech transcripts,” 2009 IEEE International Conference on Acoustics, Speech and Signal Processing, 2009, pp. 4705-4708, doi: 10.1109/ICASSP.2009.4960681.

Abstract

Discriminative re-ranking has been able to significantly improve parsing performance, and co-training has proven to be an effective weakly supervised learning algorithm to bootstrap parsers from a small in-domain seed labeled corpus using a large amount of unlabeled in-domain data. In this paper, we present systematic investigations on combining discriminative re-ranking and co-training, including co-training re-ranked parsers and co-training re-rankers. We show that combining discriminative re-ranking and co-training could improve the F-measure by 1.8%-2% absolute compared to co-training two state-of-the-art Chinese parsers without re-ranking, for parsing Mandarin broadcast news and conversation transcripts.


Read more from SRI

  • An arid, rural Nevada landscape

    Can AI help us find valuable minerals?

    SRI’s machine learning-based geospatial analytics platform, already adopted by the USGS, is poised to make waves in the mining industry.

  • Two students in a computer lab

    Building a lab-to-market pipeline for education

    The SRI-led LEARN Network demonstrates how we can get the best evidence-based educational programs to classrooms and students.

  • Code reflected in a man's eyeglasses

    LLM risks from A to Z

    A new paper from SRI and Brazil’s Instituto Eldorado delivers a comprehensive update on the security risks to large language models.