Detection of Social Roles in Conversations using Dynamic Bayesian Networks

Citation

S. Yaman, D. Hakkani-Tur and G. Tur, “Detection of social roles in conversation using dynamic bayesian networks,” in Proc. 11th Annual Conference of the International Speech Communication Association 2010 (INTERSPEECH 2010), pp. 2870–2873.

Abstract

In this paper, we focus on inferring social roles in conversations using information extracted only from the speaking styles of the speakers. We use dynamic Bayesian networks (DBNs) to model the turn-taking behavior of the speakers. DBNs provide the capability of naturally formulating the dependencies between random variables. Specifically, we first model our problem as a hidden Markov model (HMM). As it turns out, the knowledge of the segments that belong to the same speaker can be augmented into this HMM structure to form a DBN. This information places a constraint on two subsequent speaker roles such that the current speaker role depends not only on the previous speaker’s role but also on that most recent role assigned to the same speaker. We conducted an experimental study to compare these two modeling approaches using broadcast shows. In our experiments, the approach with the constraint on same speaker segments assigned 89.9% turns the correct role whereas the HMM-based approach assigned 79.2% of turns their correct role.

Keywords: Social role discovery, speaker turn detection, spoken language understanding



Read more from SRI

  • An arid, rural Nevada landscape

    Can AI help us find valuable minerals?

    SRI’s machine learning-based geospatial analytics platform, already adopted by the USGS, is poised to make waves in the mining industry.

  • Two students in a computer lab

    Building a lab-to-market pipeline for education

    The SRI-led LEARN Network demonstrates how we can get the best evidence-based educational programs to classrooms and students.

  • Code reflected in a man's eyeglasses

    LLM risks from A to Z

    A new paper from SRI and Brazil’s Instituto Eldorado delivers a comprehensive update on the security risks to large language models.