SRI authors: Han-Pang Chiu, Supun Samarasekera, Rakesh Kumar Abstract Understanding the geometric relationships between objects in a scene is a core capability in enabling both humans and autonomous agents to navigate in new environments. A sparse, unified representation of the scene topology will allow agents to act efficiently to move through their environment, communicate the […]
SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments
SRI authors: Han-Pang Chiu, Supun Samarasekera, Rakesh Kumar Abstract This paper presents a novel approach for the Vision-and-Language Navigation (VLN) task in continuous 3D environments, which requires an autonomous agent to follow natural language instructions in unseen environments. Existing end-to-end learning-based VLN methods struggle at this task as they focus mostly on utilizing raw visual […]
Dr. Yao loves to explore new frontiers, and SRI loves to innovate, allowing her to explore and advance AI-enabled tech
Blurring the lines between humans and computers.
BASF, the world’s largest chemical producer, identified Computer Vision as a critical technology for addressing a significant number of its global and societal challenges.
This scientific trailblazer is on a quest to shape the digital age at SRI.
Several developments in robotic and vehicle navigation in GPS-denied scenarios
Researchers from UTSA, UCF, AFRL and SRI have developed a new method that improves how AI learns to see