Search Results for author: Bryan Bo Cao

Found 10 papers, 4 papers with code

Memory Proxy Maps for Visual Navigation

no code implementations15 Nov 2024 Faith Johnson, Bryan Bo Cao, Ashwin Ashok, Shubham Jain, Kristin Dana

Key to our approach is a memory proxy map (MPM), an intermediate representation of the environment learned in a self-supervised manner by the high-level manager agent that serves as a simplified memory, approximating what the agent has seen.

Navigate Visual Navigation

Few-Class Arena: A Benchmark for Efficient Selection of Vision Models and Dataset Difficulty Measurement

1 code implementation2 Nov 2024 Bryan Bo Cao, Lawrence O'Gorman, Michael Coss, Shubham Jain

FCA offers a new tool for efficient machine learning in the Few-Class Regime, with goals ranging from a new efficient class similarity proposal, to lightweight model architecture design, to a new scaling law.

Image Classification

A Lightweight Measure of Classification Difficulty from Application Dataset Characteristics

no code implementations9 Apr 2024 Bryan Bo Cao, Abhinav Sharma, Lawrence O'Gorman, Michael Coss, Shubham Jain

We show how a practitioner can use this measure to help select an efficient model 6 to 29x faster than through repeated training and testing.

A Landmark-Aware Visual Navigation Dataset

no code implementations22 Feb 2024 Faith Johnson, Bryan Bo Cao, Kristin Dana, Shubham Jain, Ashwin Ashok

However, recent advancements in the visual navigation field face challenges due to the lack of human datasets in the real world for efficient supervised representation learning of the environments.

Representation Learning Visual Navigation

Feudal Networks for Visual Navigation

no code implementations19 Feb 2024 Faith Johnson, Bryan Bo Cao, Ashwin Ashok, Shubham Jain, Kristin Dana

We introduce a new approach to visual navigation using feudal learning, which employs a hierarchical structure consisting of a worker agent, a mid-level manager, and a high-level manager.

Navigate Visual Navigation

ViFiT: Reconstructing Vision Trajectories from IMU and Wi-Fi Fine Time Measurements

1 code implementation MobiCom ISACom 2023 Bryan Bo Cao, Abrar Alali, Hansi Liu, Nicholas Meegan, Marco Gruteser, Kristin Dana, Ashwin Ashok, Shubham Jain

Tracking subjects in videos is one of the most widely used functions in camera-based IoT applications such as security surveillance, smart city traffic safety enhancement, vehicle to pedestrian communication and so on.

Data-Side Efficiencies for Lightweight Convolutional Neural Networks

no code implementations24 Aug 2023 Bryan Bo Cao, Lawrence O'Gorman, Michael Coss, Shubham Jain

We examine how the choice of data-side attributes for two important visual tasks of image classification and object detection can aid in the choice or design of lightweight convolutional neural networks.

Image Classification Metric Learning +3

Vi-Fi: Associating Moving Subjects across Vision and Wireless Sensors

1 code implementation ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN) 2022 Hansi Liu, Abrar Alali, Mohamed Ibrahim, Bryan Bo Cao, Nicholas Meegan, Hongyu Li, Marco Gruteser, Shubham Jain, Kristin Dana, Ashwin Ashok, Bin Cheng, HongSheng Lu

In this paper, we present Vi-Fi, a multi-modal system that leverages a user’s smartphone WiFi Fine Timing Measurements (FTM) and inertial measurement unit (IMU) sensor data to associate the user detected on a camera footage with their corresponding smartphone identifier (e. g. WiFi MAC address).

Graph Matching Multimodal Association

Cannot find the paper you are looking for? You can Submit a new open access paper.