Prosody-Based Automatic Detection of Annoyance and Frustration in Human-Computer Dialog

Citation

Ang, J., Dhillon, R., Krupski, A., Shriberg, E., & Stolcke, A. (2002, September). Prosody-based automatic detection of annoyance and frustration in human-computer dialog. In INTERSPEECH (pp. 2037-2040).

Abstract

We investigate the use of prosody for the detection of frustration and annoyance in natural human-computer dialog. In addition to prosodic features, we examine the contribution of language model information and speaking “style”. Results show that a prosodic model can predict whether an utterance is neutral versus “annoyed or frustrated” with an accuracy on par with that of human interlabeler agreement. Accuracy increases when discriminating only “frustrated” from other utterances, and when using only those utterances on which labelers originally agreed. Furthermore, prosodic model accuracy degrades only slightly when using recognized versus true words. Language model features, even if based on true words, are relatively poor predictors of frustration. Finally, we find that hyperarticulation is not a good predictor of emotion; the two phenomena often occur independently.


Read more from SRI

  • Banner and attendees at the IEEE Hard Tech Venture Summit

    Cultivating hard tech startups that scale

    IEEE’s Hard Tech Venture Summit convened innovators at SRI to refine strategies and build new networks.

  • Patient going into a MRI

    Bringing surgical tools inside the MRI

    Drawing on SRI’s unique innovation ecosystem, the startup Medical Devices Corner is seeking to improve cancer surgery by advancing MRI-safe teleoperation.

  • Christopher Mims and Susan Patrick

    PARC Forum: How to AI

    The Wall Street Journal tech columnist Christopher Mims and SRI Education’s Susan Patrick discuss how AI can strengthen human agency.