Author: SRI International
-
Prosody Modeling for Automatic Speech Recognition and Understanding
This paper summarizes statistical modeling approaches for the use of prosody (the rhythm and melody of speech) in automatic recognition and understanding of speech.
-
Constrained optimization based control of real time large-scale systems: airjet object movement system
This paper demonstrates that hyper-redundant systems are capable of system self-identification, and that constrained optimization can effectively solve problems associated with control of many-element systems.
-
Multispeaker Speech Activity Detection for the ICSI Meeting Recorder
We have developed a more sophisticated approach for multichannel speech activity detection using a simple hidden Markov model (HMM).
-
Prosody Modeling for Automatic Speech Understanding: An Overview of Recent Research at SRI
In this paper, we summarize recent work at SRI International in the area of computational prosody modeling, and results from several recognition tasks where prosodic knowledge proved to be of help
-
Modeling Word Durations
We describe a new method of modeling duration at word level. These duration models are easily trained from the acoustic training data and can be used to rescore N-best lists of recognition hypotheses.
-
Can Prosody Aid the Automatic Processing of Multi-Party Meetings? Evidence from Predicting Punctuation, Disfluencies,and Overlapping Speech
We investigate whether probabilistic modeling of prosody can aid various automatic labeling tasks essential for processing of multi-party meetings.
-
Palm Education Pioneers Program Round I Preliminary Evaluation Report
Through the PEP program, Palm has equipped more than 175 classrooms throughout the United States with a handheld computer for every student. CTL’s research will help determine the impact that handheld technologies can have on teaching and learning.
-
Handheld Computers In Education: Current Trends And Future Research
-
Observations on Overlap: Findings and Implications for Automatic Processing of Multi-Party Conversation
We examine the distribution of overlapping speech in different corpora of natural multi-party conversations, including two types of meetings, and two corpora of telephone conversations.
-
A Framework for Robust 3-D Change Detection
We present an application of our framework for 3-D object-centered change detection to combined satellite and aerial imagery.
-
The Need For A Coordinated Engineering Discipline For The Production Of Educational Software
-
GeoVRML: Open Web-based 3D Cartography
In this paper, we will concentrate on a few of the commercially available tools that support the GeoVRML format, and also describe some of the capabilities that this solution provides geoscientists for the purpose of integrating their geographic data directly into a threedimensional (3D) computer graphics scene graph.