Automatic detection of speaker attributes based on utterance text

Citation

W. Wang, A. Kathol, and H. Bratt, “Automatic detection of speaker attributes based on utterance text,” in Proc. Interspeech, 2011, pp. 2361–2364.

Abstract

In this paper, we present models for detecting various attributes of a speaker based on uttered text alone. These attributes include whether the speaker is speaking his/her native language, the speaker’s age and gender, and the regional information reported by the speakers.  We explore various lexical features as well as features inspired by Linguistic Inquiry and Word Count and Dictionary of Affect in Language.  Overall, results suggest that when audio data is not available, by exploring effective feature sets only from uttered text and system combinations of multiple classification algorithms, we can build high quality statistical models to detect these attributes of speakers, comparable to systems that can exploit the audio data.

Index Terms: speaker attributes, machine learning, nativeness, gender, age, region


Read more from SRI

  • surgeons around a surgical robot

    The SRI research behind today’s surgical robotics

    Intuitive’s da Vinci 5 system represents a major leap in robotic-assisted medicine. It all started at SRI, which continues to advance teleoperation technologies.

  • a collage of digital graphs

    A banner year for quantum

    SRI-managed QED-C’s annual report on quantum trends captures an industry accelerating rapidly from technical promise toward major global impact.

  • ICE Cube containing SRI’s aerogel experiment, photographed prior to launch. Source: Aerospace Applications North America

    An SRI carbon capture experiment launches into space

    By synthesizing carbon-absorbing aerogels in microgravity, SRI research will give us a rare glimpse into how these materials could be radically improved.