K. Laskowski and E. Shriberg, “Modeling other talkers for improved dialog act recognition in meetings,” in Proc. 10th Annual Conference of the International Speech Communication Association 2009 (INTERSPEECH 2009), pp. 2783–2786.
Automatic dialog act (DA) modeling has been shown to benefit meeting understanding, but current approaches to DA recognition tend to suffer from a common problem: they under-represent behaviors found at turn edges, during which the “floor”is negotiated among meeting participants. We propose a new approach that takes into account speech from other talkers, relying only on speech/non-speech information from all participants. We find (1) that modeling other participants improves DA detection, even in the absence of other information, (2) that only the single locally most talkative other participant matters, and (3) that 10 seconds provides a sufficiently large local context. Results further show significant performance improvements over a lexical-only system particularly for the DAs of interest. We conclude that interaction-based modeling at turn edges can be achieved by relatively simple features and should be incorporated for improved meeting understanding.
Index Terms: vocal interaction, cross-speaker modeling, speech/non-speech, dialog acts, meetings