Author: SRI International
-
Late Fusion and Calibration for Multimedia Event Detection Using Few Examples
In this paper, we present two parametric approaches to late fusion: a normalization scheme for arithmetic mean fusion (logistic averaging) and a fusion scheme based on logistic regression, and compare them to widely used rule-based fusion schemes.
-
Unscented Transform for iVector-Based Noisy Speaker Recognition
In this paper, it is proposed to substitute the first order VTS by an unscented transform, where unlike VTS, the nonlinear function is not applied over the clean model parameters directly, but over a set of sampled points.
-
Effective Use of DCTS for Contextualizing Features for Speaker Recognition
This article proposes a new approach for contextualizing features for speaker recognition through the discrete cosine transform (DCT).
-
Studies of a Prototype Linear Stationary X-Ray Source for Tomosynthesis Imaging
A prototype linear x-ray source to implement stationary source–stationary detector tomosynthesis (TS) imaging has been studied.
-
Automatic Characterization of Speaking Styles in Educational Videos
We use crowd-sourcing to explore speaking style dimensions in online educational videos, and then propose techniques based solely on acoustic features for automatically identifying a subset of the dimensions.
-
Blended Learning Report
With funding from the Michael & Susan Dell Foundation, SRI’s Center for Technology in Learning studied the adoption of blended learning models in selected schools in California and Louisiana associated with five different charter management organizations during the 2011-12 school year. This research report presents the findings of this formative and summative research effort, including…
-
Computationally-Efficient Endpointing Features for Natural Spoken Interaction with Personal-Assistant Systems
We elicit personal-assistant speech using a recognizer with a dramatically increased endpoint threshold, and find frequent non-final pauses. Based on the new data, we develop low-cost acoustic features to discriminate non-final from final pauses.
-
Global Ethics and Virtual Worlds: Ensuring Functional Integrity in Transnational Research Studies
This paper examines a number of issues in this research context, with particular stress on the challenges posed by transnational experimental projects in virtual worlds and social networks.
-
3D Imaging Reveals Electrodynamics of Polar Cap Aurora
Rishbeth prizewinners Hanna Dahlgren and colleagues investigate the nature of an auroral arc appearing within the deep polar cap region.
-
Nicotine Dependence as a Moderator of Genetic Influences on Smoking Cessation Treatment Outcome
In this secondary analysis of clinical trial data, we examined nicotine dependence severity as a moderator of the effects of 1198 single nucleotide polymorphisms (SNPs) in 53 biologically-relevant gene regions on smoking cessation outcomes.
-
ASR Error Detection Using Recurrent Neural Network Language Model and Complementary ASR
Our goal is to locate errors in an utterance so that the dialogue manager can pose appropriate clarification questions to the users.
-
Coordinated Ionospheric Observations Indicating Coupling between Preonset Flow Bursts and Waves That Lead to Substorm Onset
A critical, long-standing problem in substorm research is identification of the sequence of events leading to substorm expansion phase onset.