no code implementations • 16 Feb 2024 • David Jin, Sushrut Karmalkar, Harry Zhang, Luca Carlone
We investigate a variation of the 3D registration problem, named multi-model 3D registration.
no code implementations • 29 Sep 2023 • Lillian Zhou, Yuxin Ding, Mingqing Chen, Harry Zhang, Rohit Prabhavalkar, Dhruv Guliani, Giovanni Motta, Rajiv Mathews
Automatic speech recognition (ASR) models are typically trained on large datasets of transcribed speech.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
1 code implementation • 21 Sep 2023 • Bo-Hsun Chen, Peter Negrut, Thomas Liang, Nevindu Batagoda, Harry Zhang, Dan Negrut
POLAR3D is the set of digital assets comprising of rock/shadow labels and obj files associated with the digital twins of lunar terrain scenarios.
no code implementations • 24 Aug 2023 • Yupu Yao, ShangQi Deng, ZiHan Cao, Harry Zhang, Liang-Jian Deng
One underlying cause is that traditional diffusion models approximate Gaussian noise distribution by utilizing predictive noise, without fully accounting for the impact of inherent information within the input itself.
no code implementations • 25 May 2023 • Sitian Shen, Zilin Zhu, Linqian Fan, Harry Zhang, Xinxiao wu
Large pre-trained models have had a significant impact on computer vision by enabling multi-modal learning, where the CLIP model has achieved impressive results in image classification, object detection, and semantic segmentation.
no code implementations • 9 May 2022 • Ben Eisner, Harry Zhang, David Held
We propose a vision-based system that learns to predict the potential motions of the parts of a variety of articulated objects to guide downstream motion planning of the system to articulate the objects.
no code implementations • 7 Oct 2021 • Dhruv Guliani, Lillian Zhou, Changwan Ryu, Tien-Ju Yang, Harry Zhang, Yonghui Xiao, Francoise Beaufays, Giovanni Motta
Federated learning can be used to train machine learning models on the edge on local data that never leave devices, providing privacy by default.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 29 May 2021 • Shivin Devgon, Jeffrey Ichnowski, Ashwin Balakrishna, Harry Zhang, Ken Goldberg
We formulate a self-supervised objective for this problem and train a deep neural network to estimate the 3D rotation as parameterized by a quaternion, between these current and desired depth images.
no code implementations • 10 Nov 2020 • Harry Zhang, Jeffrey Ichnowski, Daniel Seita, Jonathan Wang, Huang Huang, Ken Goldberg
The framework finds a 3D apex point for the robot arm, which, together with a task-specific trajectory function, defines an arcing motion that dynamically manipulates the cable to perform tasks with varying obstacle and target locations.
no code implementations • 18 Sep 2020 • Yahav Avigal, Samuel Paradis, Harry Zhang
Recent consumer demand for home robots has accelerated performance of robotic grasping.
no code implementations • 14 Dec 2019 • Khe Chai Sim, Françoise Beaufays, Arnaud Benard, Dhruv Guliani, Andreas Kabel, Nikhil Khare, Tamar Lucassen, Petr Zadrazil, Harry Zhang, Leif Johnson, Giovanni Motta, Lillian Zhou
With speech input, if the user corrects only the names, the name recall rate improves to 64. 4%.