no code implementations • 31 Oct 2024 • Zhenyu Jiang, Yuqi Xie, Kevin Lin, Zhenjia Xu, Weikang Wan, Ajay Mandlekar, Linxi Fan, Yuke Zhu
To this end, we introduce DexMimicGen, a large-scale automated data generation system that synthesizes trajectories from a handful of human demonstrations for humanoid robots with dexterous hands.
no code implementations • 16 Oct 2024 • Zhenyu Jiang, Yuqi Xie, Jinhan Li, Ye Yuan, Yifeng Zhu, Yuke Zhu
Humanoid robots, with their human-like embodiment, have the potential to integrate seamlessly into human environments.
no code implementations • 15 Oct 2024 • Jinhan Li, Yifeng Zhu, Yuqi Xie, Zhenyu Jiang, Mingyo Seo, Georgios Pavlakos, Yuke Zhu
We study the problem of teaching humanoid robots manipulation skills by imitating from single video demonstrations.
no code implementations • 19 Jun 2024 • Shuaiyi Huang, Mara Levy, Zhenyu Jiang, Anima Anandkumar, Yuke Zhu, Linxi Fan, De-An Huang, Abhinav Shrivastava
Sequential decision-making can be formulated as a text-conditioned video generation problem, where a video planner, guided by a text-defined goal, generates future frames visualizing planned actions, from which control actions are subsequently derived.
no code implementations • 22 Oct 2023 • Yifeng Zhu, Zhenyu Jiang, Peter Stone, Yuke Zhu
We introduce GROOT, an imitation learning method for learning robust policies with object-centric and 3D priors.
1 code implementation • 2 Oct 2023 • Hanwen Jiang, Zhenyu Jiang, Yue Zhao, QiXing Huang
Are camera poses necessary for multi-view 3D modeling?
no code implementations • 26 Sep 2023 • Zhenyu Jiang, Hanwen Jiang, Yuke Zhu
Incorporating semantic priors with self-supervised flow training, Doduo produces accurate dense correspondence robust to the dynamic changes of the scenes.
no code implementations • 2 Feb 2023 • Cheng-Chun Hsu, Zhenyu Jiang, Yuke Zhu
We demonstrate the effectiveness of our approach in both simulation and real-world scenes.
no code implementations • 11 Dec 2022 • Lu Wang, Bofu Tang, Feifei Liu, Zhenyu Jiang, Xianmei Meng
Objective: To systematically evaluate the value of endocytoscopy (ECS) in the diagnosis of early esophageal cancer (EC).
no code implementations • 8 Dec 2022 • Hanwen Jiang, Zhenyu Jiang, Kristen Grauman, Yuke Zhu
The reconstruction results under predicted poses are comparable to the ones using ground-truth poses.
no code implementations • 14 Mar 2022 • Bokui Shen, Zhenyu Jiang, Christopher Choy, Leonidas J. Guibas, Silvio Savarese, Anima Anandkumar, Yuke Zhu
Manipulating volumetric deformable objects in the real world, like plush toys and pizza dough, bring substantial challenges due to infinite shape variations, non-rigid motions, and partial observability.
no code implementations • CVPR 2022 • Zhenyu Jiang, Cheng-Chun Hsu, Yuke Zhu
We also apply Ditto to real-world objects and deploy the recreated digital twins in physical simulation.
1 code implementation • 12 May 2021 • Yansong Tang, Zhenyu Jiang, Zhenda Xie, Yue Cao, Zheng Zhang, Philip H. S. Torr, Han Hu
Previous cycle-consistency correspondence learning methods usually leverage image patches for training.
1 code implementation • 4 Apr 2021 • Zhenyu Jiang, Yifeng Zhu, Maxwell Svetlik, Kuan Fang, Yuke Zhu
The experimental results in simulation and on the real robot have demonstrated that the use of implicit neural representations and joint learning of grasp affordance and 3D reconstruction have led to state-of-the-art grasping results.
1 code implementation • 3 Sep 2020 • Minhyuk Sung, Zhenyu Jiang, Panos Achlioptas, Niloy J. Mitra, Leonidas J. Guibas
Shape deformation is an important component in any geometry processing toolbox.
Graphics
no code implementations • 20 Jul 2020 • Haisheng Su, Jinyuan Feng, Hao Shao, Zhenyu Jiang, Manyuan Zhang, Wei Wu, Yu Liu, Hongsheng Li, Junjie Yan
Specifically, in order to generate high-quality proposals, we consider several factors including the video feature encoder, the proposal generator, the proposal-proposal relations, the scale imbalance, and ensemble strategy.
1 code implementation • CVPR 2020 • Cheng Ma, Zhenyu Jiang, Yongming Rao, Jiwen Lu, Jie zhou
In this paper, we propose a deep face super-resolution (FSR) method with iterative collaboration between two recurrent networks which focus on facial image recovery and landmark estimation respectively.