• Skip to primary navigation
  • Skip to main content
SRI logo
  • SRI Japan
  • プレスルーム
  • NSIC
  • English 英語
Search
Close
カテゴリーなし March 30, 2012

IVECTOR-BASED PROSODIC SYSTEM FOR LANGUAGE IDENTIFICATION

Citation

Copy to clipboard


D. Martínez, L. Burget, L. Ferrer and N. Scheffer, “iVector-based prosodic system for language identification,” 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2012, pp. 4861-4864, doi: 10.1109/ICASSP.2012.6289008.

Abstract

Prosody is the part of speech where rhythm, stress, and intonation are reflected. In language identification tasks, these
characteristics are assumed to be language dependent, and
thus the language can be identified from them. In this paper, an automatic language recognition system that extracts
prosody information from speech and makes decisions about
the language with a generative classifier based on iVectors is
built. The system is tested on the NIST LRE09 dataset. The
results are still not comparable to state-of-the-art acoustic and
phonotactic systems. However, they are promising and the fusion of the new approach with an iVector-based acoustic system is found to bring further improvements over the latter.

↓ Download PDF

Share this

お問い合わせ

送信ボタンを押すと…

お問い合わせに最も適切にお答えできる人物にマッチングします。48時間以内に返信いたします。

弊社のプライバシーポリシー

SRI Japan

日本支社ニュースレター

SRI Japan/日本支社から毎月ニュースレターを配信しています。人工知能、ロボティクス、バイオサイエンスなどのSRIインターナショナルの最新技術ブレークスルーに関する記事をお読みください。

ENGLISH 英語
SRI International
  • Privacy Policy
  • Cookies
  • DMCA
  • Copyright © 2021 SRI International