Search Results for author: Harry Zhang

Found 23 papers, 5 papers with code

CUPS: Improving Human Pose-Shape Estimators with Conformalized Deep Uncertainty

no code implementations11 Dec 2024 Harry Zhang, Luca Carlone

We introduce CUPS, a novel method for learning sequence-to-sequence 3D human shapes and poses from RGB videos with uncertainty quantification.

Conformal Prediction Prediction +1

CRISP: Object Pose and Shape Estimation with Test-Time Adaptation

no code implementations2 Dec 2024 Jingnan Shi, Rajat Talak, Harry Zhang, David Jin, Luca Carlone

Our first contribution is to introduce CRISP, a category-agnostic object pose and shape estimation pipeline.

Decoder Pose Estimation +1

Enhancing Autonomous Navigation by Imaging Hidden Objects using Single-Photon LiDAR

1 code implementation4 Oct 2024 Aaron Young, Nevindu M. Batagoda, Harry Zhang, Akshat Dave, Adithya Pediredla, Dan Negrut, Ramesh Raskar

We present a novel approach that leverages Non-Line-of-Sight (NLOS) sensing using single-photon LiDAR to improve visibility and enhance autonomous navigation.

Autonomous Navigation

CHAMP: Conformalized 3D Human Multi-Hypothesis Pose Estimators

no code implementations27 May 2024 Harry Zhang, Luca Carlone

This process results in a differentiable conformal predictor that is trained end2end with the 3D pose estimator.

Conformal Prediction Prediction

Safe Deep Model-Based Reinforcement Learning with Lyapunov Functions

1 code implementation25 May 2024 Harry Zhang

The new stability-augmented framework consists of a neural-network-based learner that learns to construct a Lyapunov function, and a model-based RL agent to consistently complete the tasks while satisfying user-specified constraints given only sub-optimal demonstrations and sparse-cost feedback.

Model-based Reinforcement Learning Model Predictive Control +2

Theoretical Analysis for Expectation-Maximization-Based Multi-Model 3D Registration

no code implementations14 May 2024 David Jin, Harry Zhang, Kai Chang

We perform detailed theoretical analysis of an expectation-maximization-based algorithm recently proposed in for solving a variation of the 3D registration problem, named multi-model 3D registration.

Learning Correspondence for Deformable Objects

no code implementations14 May 2024 Priya Sundaresan, Aditya Ganapathi, Harry Zhang, Shivin Devgon

We investigate the problem of pixelwise correspondence for deformable objects, namely cloth and rope, by comparing both classical and learning-based methods.

Object Object Tracking

Multi-Model 3D Registration: Finding Multiple Moving Objects in Cluttered Point Clouds

no code implementations16 Feb 2024 David Jin, Sushrut Karmalkar, Harry Zhang, Luca Carlone

We investigate a variation of the 3D registration problem, named multi-model 3D registration.

APLA: Additional Perturbation for Latent Noise with Adversarial Training Enables Consistency

no code implementations24 Aug 2023 Yupu Yao, ShangQi Deng, ZiHan Cao, Harry Zhang, Liang-Jian Deng

One underlying cause is that traditional diffusion models approximate Gaussian noise distribution by utilizing predictive noise, without fully accounting for the impact of inherent information within the input itself.

Video Generation

DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification

no code implementations25 May 2023 Sitian Shen, Zilin Zhu, Linqian Fan, Harry Zhang, Xinxiao wu

Large pre-trained models have had a significant impact on computer vision by enabling multi-modal learning, where the CLIP model has achieved impressive results in image classification, object detection, and semantic segmentation.

3D Classification Classification +5

TAX-Pose: Task-Specific Cross-Pose Estimation for Robot Manipulation

1 code implementation17 Nov 2022 Chuer Pan, Brian Okorn, Harry Zhang, Ben Eisner, David Held

We conjecture that this relationship is a generalizable notion of a manipulation task that can transfer to new objects in the same category; examples include the relationship between the pose of a pan relative to an oven or the pose of a mug relative to a mug rack.

Pose Estimation Robot Manipulation

FlowBot3D: Learning 3D Articulation Flow to Manipulate Articulated Objects

no code implementations9 May 2022 Ben Eisner, Harry Zhang, David Held

We propose a vision-based system that learns to predict the potential motions of the parts of a variety of articulated objects to guide downstream motion planning of the system to articulate the objects.

Motion Planning

Orienting Novel 3D Objects Using Self-Supervised Learning of Rotation Transforms

no code implementations29 May 2021 Shivin Devgon, Jeffrey Ichnowski, Ashwin Balakrishna, Harry Zhang, Ken Goldberg

We formulate a self-supervised objective for this problem and train a deep neural network to estimate the 3D rotation as parameterized by a quaternion, between these current and desired depth images.

Self-Supervised Learning

Robots of the Lost Arc: Self-Supervised Learning to Dynamically Manipulate Fixed-Endpoint Cables

no code implementations10 Nov 2020 Harry Zhang, Jeffrey Ichnowski, Daniel Seita, Jonathan Wang, Huang Huang, Ken Goldberg

The framework finds a 3D apex point for the robot arm, which, together with a task-specific trajectory function, defines an arcing motion that dynamically manipulates the cable to perform tasks with varying obstacle and target locations.

ARC Self-Supervised Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.