Publications
-
Improved Discriminative Training Using Phone Lattices
We present an efficient discriminative training procedure utilizing phone lattices. Different approaches to expediting lattice generation, statistics collection, and convergence were studied.
-
Speech Translation for Low-Resource Languages: The Case of Pashto
We present a number of challenges and solutions that have arisen in the development of a speech translation system for American English and Pashto, highlighting those specific to a very…
-
Generation of fast interpreters for Huffman compressed bytecode
Our approach uses canonical Huffman codes to generate compact opcodes with custom-sized operand fields and with a virtual machine that directly executes this compact code. In effect, this automatically creates…
-
Leveraging Speaker-dependent Variation of Adaptation
This work introduces an automatic procedure for determining the size of regression class trees for individual speakers using an ensemble of speaker-level features to control the number of transformations, if…
-
A Personalized Time Management Assistant: Research Directions
This paper presents ongoing work to build the Personalized Time Manager (PTIME) system, a persistent assistant that builds on our previous work on a personalized calendar agent (PCalM) (Berry et…
-
A Robust Method for Tracking Scene Text in Video Imagery
We describe an approach that tracks planar regions of scene text that can undergo arbitrary 3-D rigid motion and scale changes. Our approach computes homographies on blocks of contiguous frames…
-
Collaborative and argumentative models of natural discussions
We report in this paper experiences and insights resulting from the first two years of work in two similar projects on meeting tracking and understanding. The projects are the DARPA-funded…
-
Identifying and Segmenting Human-Motion for Mobile Robot Navigation using alignment errors
This paper presents a new human-motion identification and segmentation algorithm from moving cameras. The algorithm is based on alignment error between pairs of moving object images. Pairs of object images…
-
Masquerade Detection via Customized Grammars
We use the Sequitur algorithm to generate a context-free grammar which efficiently extracts repetitive sequences of commands executed by one user – which is mainly used to generate a profile of the…
-
Evidence-Centered Assessment Design: Layers, Structures, and Terminology (Padi Technical Report 9)
This presentation provides an overview of ECD, highlighting the ideas of layers in the process, structures and representations within layers, and terms and concepts that can be used to guide…
-
Task Templates Based on Misconception Research (Padi Technical Report 6)
This paper reports one such effort, motivated by assessments that elicit students’ qualitative explanations of situations that have been designed to provoke misconceptions and partial understandings. We describe four task-specific…
-
Towards a Practical Stereo Vision Sensor
This paper describes an experimental framework for determining these limits using image processing algorithms, operating on graphically synthesized imagery, with performance envelope validation on real stereo image data.