Author: Han-Pang Chiu

May 18, 2022

Striking the Right Balance: Recall Loss for Semantic Segmentation

We propose a hard-class mining loss by reshaping the vanilla cross entropy loss such that it weights the loss for each class dynamically based on instantaneous recall performance.
May 18, 2022

Graph Mapper: Efficient Visual Navigation by Scene Graph Generation

We propose a method to train an autonomous agent to learn to accumulate a 3D scene graph representation of its environment by simultaneously learning to navigate through said environment.
May 18, 2022

SASRA: Semantically-aware Spatio-temporal Reasoning Agent for Vision-and-Language Navigation in Continuous Environments

This paper presents a novel approach for the Vision-and-Language Navigation (VLN) task in continuous 3D environments.
March 12, 2022

Head-Worn Markerless Augmented Reality Inside a Moving Vehicle

This paper describes a system that provides general head-worn outdoor AR capability for the user inside a moving vehicle.
January 24, 2022

SIGNAV: Semantically-Informed GPS-Denied Navigation and Mapping in Visually-Degraded Environments

We present SIGNAV, a real-time semantic SLAM system to operate in perceptually-challenging situations.
August 27, 2021

Long-Range Augmented Reality with Dynamic Occlusion Rendering

This paper addresses the problem of fast and accurate dynamic occlusion reasoning by real objects in the scene for large scale outdoor AR applications.
May 30, 2021

MaAST: Map Attention with Semantic Transformers for Efficient Visual Navigation

Through this work, we design a novel approach that focuses on performing better or comparable to the existing learning-based solutions but under a clear time/computational budget.
October 12, 2020

RGB2LIDAR: Towards Solving Large-Scale Cross-Modal Visual Localization

We study an important, yet largely unexplored problem of large-scale cross-modal visual localization by matching ground RGB images to a geo-referenced aerial LIDAR 3D point cloud.
September 9, 2019

Semantically-Aware Attentive Neural Embeddings for 2D Long-Term Visual Localization

We present an approach that combines appearance and semantic information for 2D image-based localization (2D-VL) across large perceptual changes and time lags.
September 2, 2019

Multi-Sensor Fusion for Motion Estimation in Visually-Degraded Environments

This paper analyzes the feasibility of utilizing multiple low-cost on-board sensors for ground robots or drones navigating in visually-degraded environments.
March 18, 2018

Augmented Reality Driving Using Semantic Geo-Registration

We propose a new approach that utilizes semantic information to register 2D monocular video frames to the world using 3D georeferenced data, for augmented reality driving applications.
October 16, 2017

Utilizing Semantic Visual Landmarks for Precise Vehicle Navigation

This paper presents a new approach for integrating semantic information for vision-based vehicle navigation.