Publications
-
Integrating MAP, Marginals, and Unsupervised Language Model Adaptation
We investigate the integration of various language model adaptation approaches for a cross-genre adaptation task to improve Mandarin ASR system performance on a recently introduced new genre, broadcast conversation (BC).
-
Co-training Using Prosodic and Lexical Information for Sentence Segmentation
We investigate the application of the co-training learning algorithm on the sentence boundary classification problem by using lexical and prosodic information. Co-training is a semisupervised machine learning algorithm that uses…
-
The SRI/OGI 2006 Spoken Term Detection System
This paper describes the system developed jointly at SRI and OGI for participation in the 2006 NIST Spoken Term Detection (STD) evaluation.
-
A Smoothing Kernel for Spatially Related Features and Its Application to Speaker Verification
Most commonly used kernels are invariant to permutations of the feature vector components. We will consider one such case, where the features are spatially related and show a way to…
-
Build IT: Girls Developing Information Technology Fluency Through Design. Annual Report Year 2
BuildIT is an after school and summer youth-based curriculum for low income middle school girls to develop IT fluency, interest in mathematics, and knowledge of IT careers.
-
A Semi-Supervised Learning Approach for Morpheme Segmentation for an Arabic Dialect
We evaluate our approach by applying morpheme segmentation to the training data of a statistical machine translation (SMT) system. Experiments show that our approach is less sensitive to the availability…
-
Leveraging graph locality via abstraction
The use of abstraction to speedup problem solving is ubiquitous in AI, especially in the field of heuristic search where abstraction has proven a crucial technique for creating highly accurate…
-
Regression testing for grammar-based systems
This paper describes best practices in two closely related regression testing frameworks used in grammar-based systems: MedSLT, a spoken language translation system based on the Regulus platform, and a search…
-
An LFG Chinese grammar for machine use
This paper describes the Chinese grammar developed at PARC, including its three basic components: the tokenizer and tagger, lexicon and syntactic rules.
-
PARC’s Bridge question answering system
This paper describes a system designed to robustly map from natural language sentences to logical, abstract knowledge representations (the Bridge system).
-
Overlay mechanisms for multi-level deep processing applications
This paper discusses some engineering tools that are used in the XLE grammar development platform to allow for domain specialization.
-
Nanoscience: A Vehicle For A Goals-Oriented Science Education Final Report