Computer vision publications

May 18, 2022

Striking the Right Balance: Recall Loss for Semantic Segmentation

Han-Pang Chiu

We propose a hard-class mining loss by reshaping the vanilla cross entropy loss such that it weights the loss for each class dynamically based on instantaneous recall performance.

2d-3d reasoning and augmented reality publications, Computer vision publications, Publications
May 18, 2022

Graph Mapper: Efficient Visual Navigation by Scene Graph Generation

Han-Pang Chiu, Supun Samarasekera, Rakesh Kumar

We propose a method to train an autonomous agent to learn to accumulate a 3D scene graph representation of its environment by simultaneously learning to navigate through said environment.

Collaborative human robot autonomy publications, Computer vision publications, Publications
May 18, 2022

SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments

Han-Pang Chiu, Supun Samarasekera, Rakesh Kumar

This paper presents a novel approach for the Vision-and-Language Navigation (VLN) task in continuous 3D environments.

Collaborative human robot autonomy publications, Computer vision publications, Publications
April 8, 2022

Broadening AI Ethics Narratives: An Indic Arts View

Ajay Divakaran

We investigate uncovering the unique socio-cultural perspectives embedded in human-made art, which in turn, can be valuable in expanding the horizon of AI ethics.

Computer vision publications, Machine learning publications, Publications
March 18, 2022

Real-Time Hyper-Dimensional Reconfiguration at the Edge using Hardware Accelerators

Gooitzen van der Wal, David Zhang, Michael Piacentino, Michael A. Isnardi

In this paper we present Hyper-Dimensional Reconfigurable Analytics at the Tactical Edge using low-SWaP embedded hardware that can perform real-time reconfiguration at the edge leveraging non-MAC deep neural nets (DNN)…

Computational sensing-low-power processing publications, Computer vision publications, Publications
March 14, 2022

Model-Free Generative Replay For Lifelong Reinforcement Learning: Application To Starcraft-2

Jesse Hostetler, Michael Piacentino, Ajay Divakaran

We evaluate our proposed algorithms on three different scenarios comprising tasks from the Starcraft 2 and Minigrid domains.

Computer vision publications, Machine learning publications, Publications
March 12, 2022

Head-Worn Markerless Augmented Reality Inside a Moving Vehicle

Han-Pang Chiu, Supun Samarasekera, Rakesh Kumar

This paper describes a system that provides general head-worn outdoor AR capability for the user inside a moving vehicle.

2d-3d reasoning and augmented reality publications, Computer vision publications, Publications
January 24, 2022

SIGNAV: Semantically-Informed GPS-Denied Navigation and Mapping in Visually-Degraded Environments

Han-Pang Chiu, Supun Samarasekera

We present SIGNAV, a real-time semantic SLAM system to operate in perceptually-challenging situations.

2d-3d reasoning and augmented reality publications, Computer vision publications, Publications
October 25, 2021

Generating and Evaluating Explanations of Attended and Error-Inducing Input Regions for VQA Models

Ajay Divakaran, Yi Yao

Error maps can indicate when a correctly attended region may be processed incorrectly leading to an incorrect answer, and hence, improve users’ understanding of those cases.

Computer vision publications, Machine learning publications, Publications
October 22, 2021

Challenges in Procedural Multimodal Machine Comprehension: A Novel Way to Benchmark

Ajay Divakaran

We identify three critical biases stemming from the question-answer generation process and memorization capabilities of large deep models.

Computer vision publications, Machine learning publications, Publications
October 8, 2021

Global Heading Estimation for Wide Area Augmented Reality Using Road Semantics for Geo-referencing

Supun Samarasekera, Rakesh Kumar

We present a method to estimate global camera heading by associating directional information from road segments in the camera view with annotated satellite imagery.

2d-3d reasoning and augmented reality publications, Computer vision publications, Publications
August 27, 2021

Long-Range Augmented Reality with Dynamic Occlusion Rendering

Supun Samarasekera, Han-Pang Chiu, Rakesh Kumar

This paper addresses the problem of fast and accurate dynamic occlusion reasoning by real objects in the scene for large scale outdoor AR applications.

2d-3d reasoning and augmented reality publications, Computer vision publications, Publications