Automatic Dialog Act Labeling With Minimal Supervision

Citation

Venkataraman, A., Stolcke, A., & Shriberg, E. (2002). Automatic dialog act labeling with minimal supervision.

Abstract

For many natural language applications it is desirable to be able to automatically tag utterances according to their discourse function (dialog act), such as statement, question or acknowledgment. We investigate the problem of automatically tagging dialog acts when hand labeled training data is scarce. The tagging paradigm employed is a hidden Markov model in which dialog acts are states and utterances are observations, with N-gram language models as observation models. We show that bootstrapping from a small hand-labeled training set, combined with iterative relabeling of a larger unlabeled data set, is an effective approach for preserving accuracy under conditions of limited hand-labeled training data. The dialog act grammar that models the sequencing of dialog acts is found to be of paramount importance in this approach. We analyze the effect that lack of training data has on different dialog act types, and discuss implications for efficient data annotation.


Read more from SRI

  • surgeons around a surgical robot

    The SRI research behind today’s surgical robotics

    Intuitive’s da Vinci 5 system represents a major leap in robotic-assisted medicine. It all started at SRI, which continues to advance teleoperation technologies.

  • a collage of digital graphs

    A banner year for quantum

    SRI-managed QED-C’s annual report on quantum trends captures an industry accelerating rapidly from technical promise toward major global impact.

  • ICE Cube containing SRI’s aerogel experiment, photographed prior to launch. Source: Aerospace Applications North America

    An SRI carbon capture experiment launches into space

    By synthesizing carbon-absorbing aerogels in microgravity, SRI research will give us a rare glimpse into how these materials could be radically improved.