Publications
-
Adaptive Process Management: An AI Perspective
In an effort to encourage the exchange of ideas, this paper explores how techniques from intelligent reactive control might be leveraged to provide adaptivity within workflow technologies, while also acknowledging…
-
Multistrategy learning for information extraction
We describe three different multistrategy approaches. Experiments on two IE domains a collection of electronic seminar announcements from a university computer science department and a set of newswire articles describing…
-
Toward An Integration Of The Social And The Scientific; Observing, Modeling, And Promoting The Explanatory Coherence Of Reasoning
Is scientific reasoning more likely to employ formal tools and/or more likely to involve the vigilant search for disconfirmation—something that just plain folk do, but less frequently?
-
Can Prosody Aid the Automatic Classification of Dialog Acts in Conversational Speech?
This study asks whether current approaches, which use mainly word information, could be improved by adding prosodic information. The study is based on more than 1000 conversations from the Switchboard…
-
MVIEWS: Multimodal Tools for the Video Analyst
SRI has developed MVIEWS, a system for annotating, indexing, extracting, and disseminating information from video streams for surveillance and intelligence applications. MVIEWS is implemented within the Open Agent Architecture, a…
-
Automatic Detection of Discourse Structure for Speech Recognition and Understanding
We describe a new approach for statistical modeling and detection of discourse structure for natural conversational speech. Our model is based on 42 `Dialog Acts' (DAs), (question, answer, backchannel, agreement,…
-
Using Information Extraction to Improve Information Retrieval
The authors describe an approach to applying a particular kind of Natural Language Processing NLP system to the TREC routing task in Information Retrieval IR.
-
Automatic Pronunciation Scoring of Specific Phone Segments for Language Instruction
The aim of the work described in this paper is to develop methods for automatically assessing the pronunciation quality of specific phone segments uttered by students learning a foreign language.
-
A Prosody-Only Decision-Tree Model for Disfluency Detection
We have developed a disfluency detection method using decision tree classifiers that use only local and automatically extracted prosodic features. Because the model doesn't rely on lexical information, it is…
-
Diagrammatic Methods for Deriving and Relating Temporal Neural Network Algorithms
We present an alternative approach based on a set of simple block diagram manipulation rules. The approach provides a common framework to derive popular algorithms including backpropagation and backpropagation-through-time, without…
-
A Lognormal Tied Mixture Model of Pitch for Prosody-Based Speaker Recognition
In this work, we develop a statistical model of pitch that allows unbiased estimation of pitch statistics from pitch tracks which are subject to doubling and/or halving.
-
Modeling Linguistic Segment and Turn Boundaries for N-best Rescoring of Spontaneous Speech
We present an N-best rescoring algorithm that removes the effect of segmentation mismatch. Furthermore, we show that explicit language modeling of hidden linguistic segment boundaries is improved by including turn-boundary…