R. Hadsell, Z. Kira, W. Wang, and K. Precoda, “Unsupervised topic modeling for leader detection in spoken discourse,” in Proc. 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2012), pp. 5113–5116.
In this paper, we describe a method for leader detection in multiparty spoken discourse that relies on unsupervised topic modeling to segment the discourse automatically. Latent Dirichlet allocation is applied to sliding temporal windows of utterances, resulting in a topic model which captures the fluid transitions from topic to topic which occur in multi-party discourse. Further processing discretizes the continuous topic mixtures into sequential topic segments. Features are extracted from topic shift regions and used to train a binary role classifier. The added topic shift features significantly improve the baseline performance on two corpora, demonstrating both the value of the features and the robustness of the unsupervised segmentation. Furthermore, our classification results on the ICSI corpus, using automatically segmented topics, are better than the results using ground truth segmentations.
Index Terms— Social role classification, leader detection, speaker role labeling, latent Dirichlet allocation, topic shift analysis