Computer vision publications

July 10, 2022

Dual-Key Multimodal Backdoors for Visual Question Answering

In this work, we show that multimodal networks are vulnerable to a novel type of attack that we refer to as Dual-Key Multimodal Backdoors.

Computer vision publications, Machine learning publications, Publications
June 19, 2022

Saccade Mechanisms for Image Classification, Object Detection and Tracking

ByDavid Zhang

We examine how the saccade mechanism from biological vision can be used to make deep neural networks more efficient for classification and object detection problems.

Computer vision publications, Machine learning publications, Publications
June 8, 2022

Conformal Prediction Intervals for Markov Decision Process Trajectories

ByJesse Hostetler

This paper extends previous work on conformal prediction for functional data and conformalized quantile regression to provide conformal prediction intervals over the future behavior of an autonomous system executing a…

Computer vision publications, Machine learning publications, Publications
June 8, 2022

Optimized Simultaneous Aided Target Detection and Imagery based Navigation in GPS-Denied Environments

ByHan-Pang Chiu, Supun Samarasekera, Rakesh Kumar

We describe and demonstrate a comprehensive optimized vision-based real-time solution to provide SATIN capabilities for current and future UAS in GPS-denied environments.

2d-3d reasoning and augmented reality publications, Computer vision publications, Publications
June 8, 2022

Conformal Prediction Intervals for Markov Decision Process Trajectories

ByJesse Hostetler

This paper extends previous work on conformal prediction for functional data and conformalized quantile regression to provide conformal prediction intervals over the future behavior of an autonomous system executing a…

Computer vision publications, Machine learning publications, Publications
June 6, 2022

Cross-View and Cross-Modal Visual Geo-Localization for Augmented Reality and Robot/ Vehicle Navigation Applications

ByRakesh Kumar, Supun Samarasekera, Han-Pang Chiu

We will present methods and results for estimation of geo-location and/ or orientation for dismounts and platforms for wide area, outdoor augmented reality and other applications under GPS denied/ challenged…

2d-3d reasoning and augmented reality publications, Computer vision publications, Publications
May 27, 2022

Time-Space Processing for Small Ship Detection in SAR

ByYi Yao

This paper presents a new 3D time-space detector for small ships in single look complex (SLC) synthetic aperture radar (SAR) imagery, optimized for small targets around 5-15 m long that…

Computer vision publications, Multi-modal data analytics publications, Publications
May 18, 2022

Striking the Right Balance: Recall Loss for Semantic Segmentation

ByHan-Pang Chiu

We propose a hard-class mining loss by reshaping the vanilla cross entropy loss such that it weights the loss for each class dynamically based on instantaneous recall performance.

2d-3d reasoning and augmented reality publications, Computer vision publications, Publications
May 18, 2022

Graph Mapper: Efficient Visual Navigation by Scene Graph Generation

ByHan-Pang Chiu, Supun Samarasekera, Rakesh Kumar

We propose a method to train an autonomous agent to learn to accumulate a 3D scene graph representation of its environment by simultaneously learning to navigate through said environment.

Collaborative human robot autonomy publications, Computer vision publications, Publications
May 18, 2022

SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments

ByHan-Pang Chiu, Supun Samarasekera, Rakesh Kumar

This paper presents a novel approach for the Vision-and-Language Navigation (VLN) task in continuous 3D environments.

Collaborative human robot autonomy publications, Computer vision publications, Publications
April 8, 2022

Broadening AI Ethics Narratives: An Indic Arts View

ByAjay Divakaran

We investigate uncovering the unique socio-cultural perspectives embedded in human-made art, which in turn, can be valuable in expanding the horizon of AI ethics.

Computer vision publications, Machine learning publications, Publications
March 18, 2022

Real-Time Hyper-Dimensional Reconfiguration at the Edge using Hardware Accelerators

ByGooitzen van der Wal, David Zhang, Michael Piacentino, Michael A. Isnardi

In this paper we present Hyper-Dimensional Reconfigurable Analytics at the Tactical Edge using low-SWaP embedded hardware that can perform real-time reconfiguration at the edge leveraging non-MAC deep neural nets (DNN)…

Computational sensing-low-power processing publications, Computer vision publications, Publications