J. Zheng, W. Wang and N. F. Ayan, “Development of SRI’s translation systems for broadcast news and broadcast,” in Proc. 9th Annual Conference of the International Speech Communication Association, Australia, 2008, pp. 2346–2349.
We present our recent work on developing large-vocabulary Arabic-to-English and Chinese-to-English speech-to-text translation systems for the January 2008 Global Autonomous Language Exploitation (GALE) retest evaluation. Two audio genres were involved in the evaluation: broadcast news and broadcast conversation. Our system, following the hierarchical phrase-based translation approach, has a two-pass decoding strategy, with the first-pass integrated search generating 3000 unique n-best lists, which are then reranked by several different language models in the second pass. We emphasize our work on adapting the system, which was mostly trained on text data, to the speech genres, including number tokenization, punctuation compensation, and various optimization techniques. We present our results on several different tuning and testing data sets used for system development.
Index Terms: speech-to-text translation, hierarchical-phrasebased translation, n-best reranking