SRI authors: Han-Pang Chiu, Supun Samarasekera, Rakesh Kumar Abstract Understanding the geometric relationships between objects in a scene is a core capability in enabling both humans and autonomous agents to navigate in new environments. A sparse, unified representation of the scene topology will allow agents to act efficiently to move through their environment, communicate the […]
Computer vision
SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments
SRI authors: Han-Pang Chiu, Supun Samarasekera, Rakesh Kumar Abstract This paper presents a novel approach for the Vision-and-Language Navigation (VLN) task in continuous 3D environments, which requires an autonomous agent to follow natural language instructions in unseen environments. Existing end-to-end learning-based VLN methods struggle at this task as they focus mostly on utilizing raw visual […]
Featured Innovator: Yi Yao
Dr. Yao loves to explore new frontiers, and SRI loves to innovate, allowing her to explore and advance AI-enabled tech
75 Years of Innovation: Computer vision
Blurring the lines between humans and computers.
Seeing the things that matter most
BASF selects SRI International to help refine and improve computer vision applications
BASF, the world’s largest chemical producer, identified Computer Vision as a critical technology for addressing a significant number of its global and societal challenges.
Featured Innovator: Rakesh “Teddy” Kumar
This scientific trailblazer is on a quest to shape the digital age at SRI.
75 Years of Innovation: GPS-denied navigation
Several developments in robotic and vehicle navigation in GPS-denied scenarios
Improving computer vision for AI
Researchers from UTSA, UCF, AFRL and SRI have developed a new method that improves how AI learns to see