• Skip to primary navigation
  • Skip to main content
SRI logo
  • About
    • Press room
    • Our history
  • Expertise
    • Advanced imaging systems
    • Artificial intelligence
    • Biomedical R&D services
    • Biomedical sciences
    • Computer vision
    • Cyber & formal methods
    • Education and learning
    • Innovation strategy and policy
    • National security
    • Ocean & space
    • Quantum
    • Robotics, sensors & devices
    • Speech & natural language
    • Video test & measurement
  • Ventures
  • NSIC
  • Careers
  • Contact
  • 日本支社
Search
Close
Bios September 8, 2021

Ajay Divakaran

Senior Technical Director, Vision and Learning Laboratory, Center for Vision Technologies

Ajay Divakaran, Ph.D., is technical director of the Vision and Learning Laboratory in SRI International’s Center for Vision Technologies. In this role, he is responsible for the proposal and execution of contract research projects in computer vision as well as multi-sensor systems that combine various modalities.

Divakaran is currently the principal investigator for a number of SRI research projects. His work includes social multimedia (video-audio-text) analytics, multimodal modeling and analysis of affective, cognitive, and physiological aspects of human behavior, interactive virtual reality-based training, applied machine learning, tracking of individuals in dense crowds and multi-camera tracking, and audio analysis for event detection in open-source video. He has developed several innovative technologies for multimodal systems in both commercial and government programs during the course of his career.

Prior to joining SRI in 2008, Divakaran worked at Mitsubishi Electric Research Labs for 10 years, where he was the lead inventor of the world’s first sports highlights playback-enabled DVR. He also oversaw a wide variety of product applications for machine learning.

Divakaran was named a Fellow of the IEEE in 2011 for his contributions to multimedia content analysis. He developed techniques for recognition of agitated speech for his work on automatic sports highlights extraction from broadcast sports video. He established a sound experimental and theoretical framework for human perception of action in video sequences as lead-inventor of the MPEG-7 video standard motion activity descriptor. He serves on Technical Program Committees of key multimedia conferences, and served as an associate editor of IEEE Transactions on Multimedia from 2007 to 2010. He has authored two books and has more than 100 publications to his credit, as well as more than 50 issued patents.

Divakaran received his M.S. and Ph.D. degrees in electrical engineering from Rensselaer Polytechnic Institute. His B.E. in electronics and communication engineering is from the University of Jodhpur in India.

Ajay Divakaran talks about big data, social media influence and robotic navigation on The Dish TV

Recent publications

more +
  • Towards Understanding Confusion and Affective States Under Communication Failures in Voice-Based Human-Machine Interaction

    We present a series of two studies conducted to understand user’s affective states during voice-based human-machine interactions.

  • Broadening AI Ethics Narratives: An Indic Arts View

    We investigate uncovering the unique socio-cultural perspectives embedded in human-made art, which in turn, can be valuable in expanding the horizon of AI ethics.

  • Model-Free Generative Replay For Lifelong Reinforcement Learning: Application To Starcraft-2

    We evaluate our proposed algorithms on three different scenarios comprising tasks from the Starcraft 2 and Minigrid domains.

  • Generating and Evaluating Explanations of Attended and Error-Inducing Input Regions for VQA Models

    Error maps can indicate when a correctly attended region may be processed incorrectly leading to an incorrect answer, and hence, improve users’ understanding of those cases.

  • Challenges in Procedural Multimodal Machine Comprehension: A Novel Way to Benchmark

    We identify three critical biases stemming from the question-answer generation process and memorization capabilities of large deep models.

  • Ajay Divakaran

    Senior Technical Director, Vision and Learning Laboratory, Center for Vision Technologies

Career call to action image

Work with us

Search jobs

How can we help?

Once you hit send…

We’ll match your inquiry to the person who can best help you.

Expect a response within 48 hours.

Our work

Case studies

Publications

Timeline of innovation

Areas of expertise

Institute

Leadership

Press room

Media inquiries

Compliance

Careers

Job listings

Contact

SRI Ventures

Our locations

Headquarters

333 Ravenswood Ave
Menlo Park, CA 94025 USA

+1 (650) 859-2000

Subscribe to our newsletter


日本支社
SRI International
  • Contact us
  • Privacy Policy
  • Cookies
  • DMCA
  • Copyright © 2023 SRI International
Manage Cookie Consent
To provide the best experiences, we use technologies like cookies to store and/or access device information. Consenting to these technologies will allow us to process data such as browsing behavior or unique IDs on this site. Not consenting or withdrawing consent, may adversely affect certain features and functions.
Functional Always active
The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
Preferences
The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
Statistics
The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
Marketing
The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
Manage options Manage services Manage {vendor_count} vendors Read more about these purposes
View preferences
{title} {title} {title}