no code implementations • 3 Apr 2024 • Yisheng He, Weihao Yuan, Siyu Zhu, Zilong Dong, Liefeng Bo, QiXing Huang
This paper enables high-fidelity, transferable NeRF editing by frequency decomposition.
no code implementations • 22 Mar 2024 • Zhengyi Zhao, Chen Song, Xiaodong Gu, Yuan Dong, Qi Zuo, Weihao Yuan, Zilong Dong, Liefeng Bo, QiXing Huang
In particular, the third and fourth stages are iterated, with the cuts obtained in the fourth stage encouraging non-rigid alignment in the third stage to focus on regions close to the cuts.
no code implementations • 19 Mar 2024 • Junhao Cai, Yisheng He, Weihao Yuan, Siyu Zhu, Zilong Dong, Liefeng Bo, Qifeng Chen
Derived from OmniObject3D, OO3D-9D is the largest and most diverse dataset in the field of category-level object pose and size estimation.
no code implementations • 18 Mar 2024 • Qi Zuo, Xiaodong Gu, Lingteng Qiu, Yuan Dong, Zhengyi Zhao, Weihao Yuan, Rui Peng, Siyu Zhu, Zilong Dong, Liefeng Bo, QiXing Huang
Images from video generative models are more suitable for multi-view generation because the underlying network architecture that generates them employs a temporal module to enforce frame consistency.
no code implementations • 25 Jan 2024 • Minglin Chen, Weihao Yuan, Yukun Wang, Zhe Sheng, Yisheng He, Zilong Dong, Liefeng Bo, Yulan Guo
We propose a novel synchronized generation and reconstruction method to effectively optimize the NeRF.
no code implementations • 28 Nov 2023 • Lingteng Qiu, GuanYing Chen, Xiaodong Gu, Qi Zuo, Mutian Xu, Yushuang Wu, Weihao Yuan, Zilong Dong, Liefeng Bo, Xiaoguang Han
Lifting 2D diffusion for 3D generation is a challenging problem due to the lack of geometric prior and the complex entanglement of materials and lighting in natural images.
1 code implementation • 31 Jan 2023 • Weihao Yuan, Xiaodong Gu, Heng Li, Zilong Dong, Siyu Zhu
In this work, we propose an SDF transformer network, which replaces the role of 3D CNN for better 3D feature aggregation.
no code implementations • 21 Jan 2023 • Heng Li, Xiaodong Gu, Weihao Yuan, Luwei Yang, Zilong Dong, Ping Tan
To reach this challenging goal without depth input, we introduce a hierarchical feature volume to facilitate the implicit map decoder.
no code implementations • 14 Jan 2023 • Meng Li, Senbo Wang, Weihao Yuan, Weichao Shen, Zhe Sheng, Zilong Dong
In this paper, we propose an end-to-end deep network for monocular panorama depth estimation on a unit spherical surface.
no code implementations • 23 May 2022 • Xiaodong Gu, Chengzhou Tang, Weihao Yuan, Zuozhuo Dai, Siyu Zhu, Ping Tan
In the experiments, we evaluate the proposed method on both the 3D scene flow estimation and the point cloud registration task.
1 code implementation • CVPR 2022 • Weihao Yuan, Xiaodong Gu, Zuozhuo Dai, Siyu Zhu, Ping Tan
While recent works design increasingly complicated and powerful networks to directly regress the depth map, we take the path of CRFs optimization.
Ranked #1 on Depth Prediction on Matterport3D
1 code implementation • CVPR 2022 • Xiaodong Gu, Chengzhou Tang, Weihao Yuan, Zuozhuo Dai, Siyu Zhu, Ping Tan
In the experiments, we evaluate the proposed method on both the 3D scene flow estimation and the point cloud registration task.
no code implementations • CVPR 2022 • Weihao Yuan, Xiaodong Gu, Zuozhuo Dai, Siyu Zhu, Ping Tan
Estimating the accurate depth from a single image is challenging since it is inherently ambiguous and ill-posed.
no code implementations • 5 Aug 2021 • Weihao Yuan, Rui Fan, Michael Yu Wang, Qifeng Chen
We design a multiscopic vision system that utilizes a low-cost monocular RGB camera to acquire accurate depth estimation.
no code implementations • 9 Apr 2021 • Weihao Yuan, Yazhan Zhang, Bingkun Wu, Siyu Zhu, Ping Tan, Michael Yu Wang, Qifeng Chen
Self-supervised learning for depth estimation possesses several advantages over supervised learning.
1 code implementation • 24 Mar 2021 • Xiaodong Gu, Weihao Yuan, Zuozhuo Dai, Siyu Zhu, Chengzhou Tang, Zilong Dong, Ping Tan
There are increasing interests of studying the video-to-depth (V2D) problem with machine learning techniques.
3 code implementations • 22 Mar 2021 • Zuozhuo Dai, Guangyuan Wang, Weihao Yuan, Xiaoli Liu, Siyu Zhu, Ping Tan
Thus, our method can solve the problem of cluster inconsistency and be applicable to larger data sets.
Ranked #1 on Unsupervised Person Re-Identification on PersonX
1 code implementation • 3 Aug 2020 • Weihao Yuan, Michael Yu Wang, Qifeng Chen
Self-supervised learning for visual object tracking possesses valuable advantages compared to supervised learning, such as the non-necessity of laborious human annotations and online training.
1 code implementation • 22 Jan 2020 • Weihao Yuan, Rui Fan, Michael Yu Wang, Qifeng Chen
We design a multiscopic vision system that utilizes a low-cost monocular RGB camera to acquire accurate depth estimation for robotic applications.
no code implementations • 9 Oct 2019 • Yazhan Zhang, Weihao Yuan, Zicheng Kan, Michael Yu Wang
In essence, successful grasp boils down to correct responses to multiple contact events between fingertips and objects.
no code implementations • 12 Sep 2018 • Weihao Yuan, Kaiyu Hang, Haoran Song, Danica Kragic, Michael Y. Wang, Johannes A. Stork
Moving a human body or a large and bulky object can require the strength of whole arm manipulation (WAM).
no code implementations • 15 Mar 2018 • Weihao Yuan, Johannes A. Stork, Danica Kragic, Michael Y. Wang, Kaiyu Hang
Usually, this is achieved by precisely modeling physical properties of the objects, robot, and the environment for explicit planning.