Search Results for author: Harry Zhang

Found 11 papers, 1 papers with code

Multi-Model 3D Registration: Finding Multiple Moving Objects in Cluttered Point Clouds

no code implementations • 16 Feb 2024 • David Jin, Sushrut Karmalkar, Harry Zhang, Luca Carlone

We investigate a variation of the 3D registration problem, named multi-model 3D registration.

Paper
Add Code

The Gift of Feedback: Improving ASR Model Quality by Learning from User Corrections through Federated Learning

no code implementations • 29 Sep 2023 • Lillian Zhou, Yuxin Ding, Mingqing Chen, Harry Zhang, Rohit Prabhavalkar, Dhruv Guliani, Giovanni Motta, Rajiv Mathews

Automatic speech recognition (ASR) models are typically trained on large datasets of transcribed speech.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

POLAR3D: Augmenting NASA's POLAR Dataset for Data-Driven Lunar Perception and Rover Simulation

1 code implementation • 21 Sep 2023 • Bo-Hsun Chen, Peter Negrut, Thomas Liang, Nevindu Batagoda, Harry Zhang, Dan Negrut

POLAR3D is the set of digital assets comprising of rock/shadow labels and obj files associated with the digital twins of lunar terrain scenarios.

Paper
Code

APLA: Additional Perturbation for Latent Noise with Adversarial Training Enables Consistency

no code implementations • 24 Aug 2023 • Yupu Yao, ShangQi Deng, ZiHan Cao, Harry Zhang, Liang-Jian Deng

One underlying cause is that traditional diffusion models approximate Gaussian noise distribution by utilizing predictive noise, without fully accounting for the impact of inherent information within the input itself.

Video Generation

Paper
Add Code

DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification

no code implementations • 25 May 2023 • Sitian Shen, Zilin Zhu, Linqian Fan, Harry Zhang, Xinxiao wu

Large pre-trained models have had a significant impact on computer vision by enabling multi-modal learning, where the CLIP model has achieved impressive results in image classification, object detection, and semantic segmentation.

3D Classification Classification +5

Paper
Add Code

FlowBot3D: Learning 3D Articulation Flow to Manipulate Articulated Objects

no code implementations • 9 May 2022 • Ben Eisner, Harry Zhang, David Held

We propose a vision-based system that learns to predict the potential motions of the parts of a variety of articulated objects to guide downstream motion planning of the system to articulate the objects.

Motion Planning

Paper
Add Code

Enabling On-Device Training of Speech Recognition Models with Federated Dropout

no code implementations • 7 Oct 2021 • Dhruv Guliani, Lillian Zhou, Changwan Ryu, Tien-Ju Yang, Harry Zhang, Yonghui Xiao, Francoise Beaufays, Giovanni Motta

Federated learning can be used to train machine learning models on the edge on local data that never leave devices, providing privacy by default.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Orienting Novel 3D Objects Using Self-Supervised Learning of Rotation Transforms

no code implementations • 29 May 2021 • Shivin Devgon, Jeffrey Ichnowski, Ashwin Balakrishna, Harry Zhang, Ken Goldberg

We formulate a self-supervised objective for this problem and train a deep neural network to estimate the 3D rotation as parameterized by a quaternion, between these current and desired depth images.

Self-Supervised Learning

Paper
Add Code

Robots of the Lost Arc: Self-Supervised Learning to Dynamically Manipulate Fixed-Endpoint Cables

no code implementations • 10 Nov 2020 • Harry Zhang, Jeffrey Ichnowski, Daniel Seita, Jonathan Wang, Huang Huang, Ken Goldberg

The framework finds a 3D apex point for the robot arm, which, together with a task-specific trajectory function, defines an arcing motion that dynamically manipulates the cable to perform tasks with varying obstacle and target locations.

Self-Supervised Learning

Paper
Add Code

6-DoF Grasp Planning using Fast 3D Reconstruction and Grasp Quality CNN

no code implementations • 18 Sep 2020 • Yahav Avigal, Samuel Paradis, Harry Zhang

Recent consumer demand for home robots has accelerated performance of robotic grasping.

3D Reconstruction Robotic Grasping

Paper
Add Code

Personalization of End-to-end Speech Recognition On Mobile Devices For Named Entities

no code implementations • 14 Dec 2019 • Khe Chai Sim, Françoise Beaufays, Arnaud Benard, Dhruv Guliani, Andreas Kabel, Nikhil Khare, Tamar Lucassen, Petr Zadrazil, Harry Zhang, Leif Johnson, Giovanni Motta, Lillian Zhou

With speech input, if the user corrects only the names, the name recall rate improves to 64. 4%.

speech-recognition Speech Recognition

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.