We present an approach for autonomous sensor control for information gathering under partially observable, dynamic and sparsely sampled environments.
Computer vision publications
Towards Understanding Confusion and Affective States Under Communication Failures in Voice-Based Human-Machine Interaction
We present a series of two studies conducted to understand user’s affective states during voice-based human-machine interactions. Emphasis is placed on the cases of communication errors or failures.
Incremental Learning with Differentiable Architecture and Forgetting Search
In this paper, we show that leveraging NAS for incremental learning results in strong performance gains for classification tasks.
Saccade Mechanisms for Image Classification, Object Detection and Tracking
We examine how the saccade mechanism from biological vision can be used to make deep neural networks more efficient for classification and object detection problems. Our proposed approach is based on the ideas of attention-driven visual processing and saccades, miniature eye movements influenced by attention.
Conformal Prediction Intervals for Markov Decision Process Trajectories
This paper extends previous work on conformal prediction for functional data and conformalized quantile regression…
Time-Space Processing for Small Ship Detection in SAR
This paper presents a new 3D time-space detector for small ships in single look complex (SLC) synthetic aperture radar (SAR) imagery, optimized for small targets around 5-15 m long that are unfocused due to target motion induced by ocean surface waves.
Striking the Right Balance: Recall Loss for Semantic Segmentation
Class imbalance is a fundamental problem in computer vision applications such as semantic segmentation.
Graph Mapper: Efficient Visual Navigation by Scene Graph Generation
We propose a method to train an autonomous agent to learn to accumulate a 3D scene graph representation of its environment by simultaneously learning to navigate through said environment.
SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments
This paper presents a novel approach for the Vision-and-Language Navigation (VLN) task in continuous 3D environments.