SenSay™ Analytics speaker state platform


Sensing what people say, and how they say it.

SenSay Analytics is platform that performs real-time speaker state classification from spoken audio. SRI is working with research and industry partners using SenSay Analytics to estimate speaker state such as emotion, sentiment, cognition, health, mental health and communication quality in a range of end applications including:

  • Personal assistants
  • Health monitoring
  • Service robots
  • Interpersonal skills
  • Driver monitoring
  • Gaming
  • Education
  • Marketing/advertising

How it works

At sub-second intervals, the SenSay platform updates both features and class estimates using advanced signal features that capture spectral, prosodic, articulatory, auditory, discourse and fluency characteristics, as well as features designed specifically for robustness to noise and reverberation.

SenSay can analyze the features from the signal alone or combined with automatic speech recognition to model word-based information via sentiment models. Features are modeled using state-of-the-art machine learning approaches appropriate to the task, training data and application constraints.

SenSay advantages


Platform provides class and feature updates at < 1 second; crucial for applications such as driver monitoring, dialog system response and customer service.

Control, privacy

SenSay can run in cloud, on laptop/desktop or in client-hosted environment.


Deploy on premises or in the cloud. Use as a feature extractor, class predictor or both. Use with or without automatic speech recognition.


APIs let clients add additional sensor capabilities such as video-based or physiological features.


Platform can be adapted to task, domain, language, single- or multi-party conversations.


SenSay is architected to support simultaneous live streams.


For more information or to work with us, e-mail

Read more from SRI