• Skip to primary navigation
  • Skip to main content
SRI logo
  • About
    • Press room
    • Our history
  • Expertise
    • Advanced imaging systems
    • Artificial intelligence
    • Biomedical R&D services
    • Biomedical sciences
    • Computer vision
    • Cyber & formal methods
    • Education and learning
    • Innovation strategy and policy
    • National security
    • Ocean & space
    • Quantum
    • QED-C
    • Robotics, sensors & devices
    • Speech & natural language
    • Video test & measurement
  • Ventures
  • NSIC
  • Careers
  • Contact
  • 日本支社
Search
Close
Home » Archives for Mitchell McLaren » Page 4
Mitchell McLaren

Mitchell McLaren

Senior Computer Scientist, Speech Technology and Research Laboratory
Go to bio page

Publications

Speech & natural language publications November 1, 2013 Article

Quality Measure Functions for Calibration of Speaker Recognition Systems in Various Duration Conditions

SRI International, Mitchell McLaren

SRI Authors: Mitchell McLaren

Speech & natural language publications November 1, 2013 Conference Paper

Recent Developments in Voice Biometrics: Robustness and High Accuracy

Aaron Lawson, Mitchell McLaren

Recently, researchers have tackled difficult voice biometrics problems that resonate with the defense and research communities. These problems include non-ideal recording conditions that are frequently found in operational scenarios, such as noise, reverberation, degraded channels, and compressed audio. In this article, we highlight SRI’s innovations that resulted from the IARPA Biometrics Exploitation Science & Technology (BEST) and the DARPA Robust Automatic Transcription of Speech (RATS) programs, as well as SRI’s approach for codec degraded speech. We show how these advancements support the case for the biometrics community adopting the use of speaker recognition.

Speech & natural language publications August 1, 2013 Conference Paper

Modulation features for noise robust speaker identification

Horacio Franco, Martin Graciarena, Mitchell McLaren

In this paper, we present a robust acoustic feature on top of robust modeling techniques to further improve speaker identification performance.

Speech & natural language publications August 1, 2013 Conference Paper

A Noise-Robust System for NIST 2012 Speaker Recognition Evaluation

Martin Graciarena, Mitchell McLaren

The National Institute of Standards and Technology (NIST) 2012 speaker recognition evaluation posed several new challenges including noisy data, varying test-sample length and number of enrollment samples, and a new metric.

Speech & natural language publications August 1, 2013 Conference Paper

Adaptive Gaussian Backend for Robust Language Identification

Aaron Lawson, Mitchell McLaren

This paper proposes adaptive Gaussian backend (AGB), a novel approach to robust language identification (LID). In this approach, a given test sample is compared to language-specific training data in order to dynamically select data for a trial-specific language model. Discriminative AGB additionally weights the training data to maximize discrimination against the test segment. Evaluated on heavily degraded speech data, discriminative AGB provides relative improvements of up to 45% and 38% in equal error rates (EER) over the widely adopted Gaussian backend (GB) and neural network (NN) approaches to LID, respectively. Discriminative AGB also significantly outperforms those techniques at shorter test durations, while demonstrating robustness to limited training resources and to mismatch between training and testing speech duration. The efficacy of AGB is validated on clean speech data from National Institute of Standards and Technology (NIST) language recognition evaluation (LRE) 2009, on which it was found to provide improvements over the GB and NN approaches.

Speech & natural language publications August 1, 2013 Conference Paper

Improving Language Identification Robustness to Highly Channel-Degraded Speech through Multiple System Fusion

Aaron Lawson, Martin Graciarena, Mitchell McLaren

We describe a language identification system developed for robustess to noise conditions such as those encountered under the DARPA RATS program, which is focused on multi-channel audio collected in high noise conditions. Work presented here includes novel approaches to scoring iVectors, the introduction of several new acoustic and prosodic features for language identification, and discriminative file selection approaches to score calibration.  Further, we explore the use of Discrete Cosine Transforms (DCT) as a supplement to traditional context modeling with Shifted Delta Cepstrum (SDC) and fusion of multiple iVector systems based on Gaussian backends, neural networks, and adaptive Gaussian backend modeling.

Speech & natural language publications May 1, 2013 Conference Paper

Improving Speaker Identification Robustness to Highly Channel-Degraded Speech Through Multiple System Fusion

Mitchell McLaren, Martin Graciarena

This article describes our submission to the speaker identification (SID) evaluation for the first phase of the DARPA Robust Audio and Transcription of Speech (RATS) program. 

  • « Go to Previous Page
  • Go to page 1
  • Go to page 2
  • Go to page 3
  • Go to page 4

How can we help?

Once you hit send…

We’ll match your inquiry to the person who can best help you.

Expect a response within 48 hours.

Career call to action image

Make your own mark.

Search jobs

Our work

Case studies

Publications

Timeline of innovation

Areas of expertise

Institute

Leadership

Press room

Media inquiries

Compliance

Careers

Job listings

Contact

SRI Ventures

Our locations

Headquarters

333 Ravenswood Ave
Menlo Park, CA 94025 USA

+1 (650) 859-2000

Subscribe to our newsletter


日本支社
SRI International
  • Contact us
  • Privacy Policy
  • Cookies
  • DMCA
  • Copyright © 2022 SRI International