Publications
-
Improved Modeling and Efficiency for Automatic Transcription of Broadcast News
In this paper, we report on our research and progress on the DARPA-sponsored Hub-4 continuous speech recognition evaluations, with an emphasis on efficient modeling.
-
Learning Chemistry Through The Use Of A Representation-Based Knowledge Building Environment
The ChemSense project is a multidisciplinary program of research and development to examine the impact of representational tools, chemical investigations, and discourse on chemistry learning and teaching in high schools…
-
The Pathway Tools Software
The Pathway Tools is a reusable, production-quality software environment for creating a type of MOD called a Pathway/Genome Database (PGDB).
-
Constrained optimization based control of real time large-scale systems: airjet object movement system
This paper demonstrates that hyper-redundant systems are capable of system self-identification, and that constrained optimization can effectively solve problems associated with control of many-element systems.
-
Multispeaker Speech Activity Detection for the ICSI Meeting Recorder
We have developed a more sophisticated approach for multichannel speech activity detection using a simple hidden Markov model (HMM).
-
Palm Education Pioneers Program Round I Preliminary Evaluation Report
Through the PEP program, Palm has equipped more than 175 classrooms throughout the United States with a handheld computer for every student. CTL’s research will help determine the impact that…
-
Capturing Analytic Thought
We are developing a new methodology that retains the ease of use, the familiarity, and (some of) the free-form nature of informal methods, while benefiting from the rigor, structure, and…
-
Modeling Word Durations
We describe a new method of modeling duration at word level. These duration models are easily trained from the acoustic training data and can be used to rescore N-best lists…
-
Prosody Modeling for Automatic Speech Understanding: An Overview of Recent Research at SRI
In this paper, we summarize recent work at SRI International in the area of computational prosody modeling, and results from several recognition tasks where prosodic knowledge proved to be of…
-
Can Prosody Aid the Automatic Processing of Multi-Party Meetings? Evidence from Predicting Punctuation, Disfluencies,and Overlapping Speech
We investigate whether probabilistic modeling of prosody can aid various automatic labeling tasks essential for processing of multi-party meetings.
-
The GeoWeb — A New Paradigm for Finding Data on the Web
We propose to build and maintain this open standards-based infrastructure on a new top-level domain called .geo that will enable anybody to publish and search for all metadata referring to…
-
Observations on Overlap: Findings and Implications for Automatic Processing of Multi-Party Conversation
We examine the distribution of overlapping speech in different corpora of natural multi-party conversations, including two types of meetings, and two corpora of telephone conversations.