SRI International

January 1, 1998

MVIEWS: Multimodal Tools for the Video Analyst

SRI has developed MVIEWS, a system for annotating, indexing, extracting, and disseminating information from video streams for surveillance and intelligence applications. MVIEWS is implemented within the Open Agent Architecture, a distributed multiagent framework that enables rapid integration of component technologies.

January 1, 1998

Toward An Integration Of The Social And The Scientific; Observing, Modeling, And Promoting The Explanatory Coherence Of Reasoning

Is scientific reasoning more likely to employ formal tools and/or more likely to involve the vigilant search for disconfirmation—something that just plain folk do, but less frequently?

January 1, 1998

Discriminative Training of Minimum Cost Speaker Verification Systems

This paper presents a new training procedure for speaker verification systems. Results are presented from the 1997 NIST Speaker Recognition Evaluation corpus indicating that the VCF performance can be improved with this procedure.

January 1, 1998

Counselor Simulation Guide, Levels 1-3

January 1, 1998

Hybrid Planning: An Approach to Integrating Generative and Case-Based Planning

This paper describes ongoing research on the development of a hybrid planning system that integrates case-based reasoning (CBR) methods into SIPE-2, a generative planning system.

January 1, 1998

Connecting With The Twenty-First Century: Technology In Support Of Educational Reform

We present an alternative image of what schools might be like and a set of interlocking social, pedagogical and technological changes that could transform the educational enterprise.

January 1, 1998

The Mathematics Of Change And Variation From A Millennial Perspective: New Content, New Context

We shall examine large-scale trends in content changes and in context changes for learning and using mathematics. From this set of factors, we raise three questions.

January 1, 1998

Can Prosody Aid the Automatic Classification of Dialog Acts in Conversational Speech?

This study asks whether current approaches, which use mainly word information, could be improved by adding prosodic information. The study is based on more than 1000 conversations from the Switchboard corpus.

December 1, 1997

Automatic Detection of Discourse Structure for Speech Recognition and Understanding

We describe a new approach for statistical modeling and detection of discourse structure for natural conversational speech. Our model is based on 42 `Dialog Acts’ (DAs), (question, answer, backchannel, agreement, disagreement, apology, etc).

November 1, 1997

Using Information Extraction to Improve Information Retrieval

The authors describe an approach to applying a particular kind of Natural Language Processing NLP system to the TREC routing task in Information Retrieval IR.

September 1, 1997

A Prosody-Only Decision-Tree Model for Disfluency Detection

We have developed a disfluency detection method using decision tree classifiers that use only local and automatically extracted prosodic features. Because the model doesn’t rely on lexical information, it is widely applicable even when word recognition is unreliable.

September 1, 1997

Acoustic Clustering and Adaptation for Robust Speech Recognition

We describe an algorithm based on acoustic clustering and acoustic adaptation to significantly improve speech recognition performance. The method is particularly useful when speech from multiple speakers is to be recognized and the boundary between speakers is not known.

Author: SRI International