Publications

August 23, 2007

An Unfinished Canvas – Arts Education in the San Francisco Bay Area: A Supplementary Status Report

ByKatrina Woodworth

The research supporting An Unfinished Canvas was undertaken to document the status of arts education in California schools and assess the extent to which schools were meeting state goals for…

Education & learning publications, Publications
August 21, 2007

Textual inference logic: take two

This note describes a logical system based on concepts and contexts, whose aim is to serve as a representation language for meanings of natural language sentences.

Future Concepts division- publications, Publications
August 3, 2007

Assimilating IT in the workplace: a study of situated learning

Publications
August 1, 2007

Co-training Using Prosodic and Lexical Information for Sentence Segmentation

We investigate the application of the co-training learning algorithm on the sentence boundary classification problem by using lexical and prosodic information. Co-training is a semisupervised machine learning algorithm that uses…

Publications, Speech & natural language publications
August 1, 2007

A Smoothing Kernel for Spatially Related Features and Its Application to Speaker Verification

Most commonly used kernels are invariant to permutations of the feature vector components. We will consider one such case, where the features are spatially related and show a way to…

Publications, Speech & natural language publications
August 1, 2007

The SRI/OGI 2006 Spoken Term Detection System

ByDimitra Vergyri

This paper describes the system developed jointly at SRI and OGI for participation in the 2006 NIST Spoken Term Detection (STD) evaluation.

Publications, Speech & natural language publications
August 1, 2007

Build IT: Girls Developing Information Technology Fluency Through Design. Annual Report Year 2

BuildIT is an after school and summer youth-based curriculum for low income middle school girls to develop IT fluency, interest in mathematics, and knowledge of IT careers.

Education & learning publications, Publications, STEM and computer science education publications
August 1, 2007

A Semi-Supervised Learning Approach for Morpheme Segmentation for an Arabic Dialect

We evaluate our approach by applying morpheme segmentation to the training data of a statistical machine translation (SMT) system. Experiments show that our approach is less sensitive to the availability…

Publications, Speech & natural language publications
August 1, 2007

IraqComm: A Next Generation Translation System

ByKristin Precoda, Dimitra Vergyri, Horacio Franco

This paper describes the IraqComm translation system that mediates and translates spontaneous conversations between an English speaker and a speaker of colloquial Iraqi Arabic.

Publications, Speech & natural language publications
August 1, 2007

Advances in Mandarin Broadcast Speech Recognition

We describe our continuing efforts to improve the UW-SRI-ICSI Mandarin broadcast speech recognizer.

Publications, Speech & natural language publications
August 1, 2007

Detecting Deception Using Critical Segments

We present an investigation of segments that map to GLOBAL LIES, that is, the intent to deceive with respect to salient topics of the discourse. We propose that identifying the…

Publications, Speech & natural language publications
August 1, 2007

Duration and Pronunciation Conditioned Lexical Modeling for Speaker Verification

We propose a method to improve speaker recognition lexical model performance using acoustic-prosodic information. More specifically, the lexical model is trained using duration- and pronunciation-conditioned word N-grams, simultaneously modeling lexical…

Publications, Speech & natural language publications