Computer vision publications

April 1, 2021

Modular Adaptation for Cross-Domain Few-Shot Learning

ByAjay Divakaran, Yi Yao

While literature has demonstrated great successes via representation learning, in this work, we show that improvement of downstream tasks can also be achieved by appropriate designs of the adaptation process.

Computer vision publications, Machine learning publications, Publications
April 1, 2021

Confidence Calibration for Domain Generalization under Covariate Shift

ByYi Yao, Ajay Divakaran, Melinda Gervasio

We present novel calibration solutions via domain generalization. Our core idea is to leverage multiple calibration domains to reduce the effective distribution disparity between the target and calibration domains for…

Computer vision publications, Machine learning publications, Publications
November 19, 2020

Hybrid Consistency Training with Prototype Adaptation for Few-Shot Learning

ByYi Yao, Ajay Divakaran

We introduce Hybrid Consistency Training to jointly leverage interpolation consistency, including interpolating hidden features, that imposes linear behavior locally and data augmentation consistency that learns robust embeddings against sample variations.

Computer vision publications, Machine learning publications, Publications
October 12, 2020

RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual Localization

ByHan-Pang Chiu, Supun Samarasekera, Rakesh Kumar

We study an important, yet largely unexplored problem of large-scale cross-modal visual localization by matching ground RGB images to a geo-referenced aerial LIDAR 3D point cloud.

2d-3d reasoning and augmented reality publications, Computer vision publications, Publications
July 14, 2020

Lifelong learning using Eigentasks: Task separation, skill acquisition, and selective transfer

ByAjay Divakaran, Jesse Hostetler

We introduce the eigentask framework for lifelong learning. An eigentask is a pairing of a skill that solves a set of related tasks, paired with a generative model that can…

Computer vision publications, Machine learning publications, Publications
March 16, 2020

Deep Adaptive Semantic Logic (DASL): Compiling Declarative Knowledge into Deep Neural Networks

ByAndrew Silberfarb, John Byrnes, Ajay Divakaran

We introduce Deep Adaptive Semantic Logic (DASL), a novel framework for automating the generation of deep neural networks that incorporates user-provided formal knowledge to improve learning from data.

Computer vision publications, Multi-modal data analytics publications, Publications
December 16, 2019

Bit Efficient Quantization for Deep Neural Networks

ByDavid Zhang

In this paper, we present a comparison of model-parameter driven quantization approaches that can achieve as low as 3-bit precision without affecting accuracy.

Computational sensing-low-power processing publications, Computer vision publications, Publications
September 9, 2019

Semantically-Aware Attentive Neural Embeddings for 2D Long-Term Visual Localization

ByHan-Pang Chiu, Supun Samarasekera, Rakesh Kumar

We present an approach that combines appearance and semantic information for 2D image-based localization (2D-VL) across large perceptual changes and time lags.

2d-3d reasoning and augmented reality publications, Computer vision publications, Publications
September 2, 2019

Multi-Sensor Fusion for Motion Estimation in Visually-Degraded Environments

ByHan-Pang Chiu, Supun Samarasekera

This paper analyzes the feasibility of utilizing multiple low-cost on-board sensors for ground robots or drones navigating in visually-degraded environments.

2d-3d reasoning and augmented reality publications, Computer vision publications, Publications
June 15, 2019

Stacked Spatio-Temporal Graph Convolutional Networks for Action Segmentation

ByAjay Divakaran, Yi Yao

We propose novel Stacked Spatio-Temporal Graph Convolutional Networks (Stacked-STGCN) for action segmentation, i.e., predicting and localizing a sequence of actions over long videos.

Computer vision publications, Multi-modal data analytics publications, Publications
April 11, 2019

Toward Runtime Throttleable Neural Networks

ByJesse Hostetler

This paper presents an approach to creating runtime-throttleable NNs that can adaptively balance performance and resource use in response to a control signal.

Computer vision publications, Machine learning publications, Publications
April 2, 2019

Lucid Explanations Help: Using a Human-AI Image-Guessing Game to Evaluate Machine Explanation Helpfulness

ByAjay Divakaran, Yi Yao

We propose a Twenty-Questions style collaborative image retrieval game as a method of evaluating the efficacy of explanations (visual evidence or textual justification) in the context of Visual Question Answering.

Artificial intelligence publications, Computer vision publications, Machine learning publications, Publications