no code implementations • 4 Dec 2024 • Zehuan Huang, Yuan-Chen Guo, Haoran Wang, Ran Yi, Lizhuang Ma, Yan-Pei Cao, Lu Sheng
To efficiently model the 3D geometric knowledge within the adapter, we introduce innovative designs that include duplicated self-attention layers and parallel attention architecture, enabling the adapter to inherit the powerful priors of the pre-trained models to model the novel 3D knowledge.
no code implementations • 4 Dec 2024 • Zehuan Huang, Yuan-Chen Guo, Xingqiao An, Yunhan Yang, Yangguang Li, Zi-Xin Zou, Ding Liang, Xihui Liu, Yan-Pei Cao, Lu Sheng
This paper introduces MIDI, a novel paradigm for compositional 3D scene generation from a single image.
no code implementations • 26 Nov 2024 • Yijia Hong, Yuan-Chen Guo, Ran Yi, Yulong Chen, Yan-Pei Cao, Lizhuang Ma
We present SuperMat, a single-step framework that achieves high-quality material decomposition with one-step inference.
1 code implementation • 22 Nov 2024 • Xin Yu, Ze Yuan, Yuan-Chen Guo, Ying-Tian Liu, Jianhui Liu, Yangguang Li, Yan-Pei Cao, Ding Liang, Xiaojuan Qi
Instead, we focus on the fundamental problem of learning in the UV texture space itself.
1 code implementation • 11 Nov 2024 • Yunhan Yang, Yukun Huang, Yuan-Chen Guo, Liangjun Lu, Xiaoyang Wu, Edmund Y. Lam, Yan-Pei Cao, Xihui Liu
For flexibility, we distill scale-conditioned part-aware 3D features for 3D part segmentation at multiple granularities.
no code implementations • 16 Oct 2024 • Jingxiang Sun, Cheng Peng, Ruizhi Shao, Yuan-Chen Guo, Xiaochen Zhao, Yangguang Li, YanPei Cao, Bo Zhang, Yebin Liu
We introduce DreamCraft3D++, an extension of DreamCraft3D that enables efficient high-quality generation of complex 3D assets.
no code implementations • 20 Dec 2023 • Yue-Jiang Dong, Yuan-Chen Guo, Ying-Tian Liu, Fang-Lue Zhang, Song-Hai Zhang
Self-supervised monocular depth estimation is of significant importance with applications spanning across autonomous driving and robotics.
no code implementations • CVPR 2024 • Ying-Tian Liu, Yuan-Chen Guo, Guan Luo, Heyi Sun, Wei Yin, Song-Hai Zhang
However, the generation quality and generalization ability of 3D diffusion models is hindered by the scarcity of high-quality and large-scale 3D datasets.
1 code implementation • CVPR 2024 • Zi-Xin Zou, Zhipeng Yu, Yuan-Chen Guo, Yangguang Li, Ding Liang, Yan-Pei Cao, Song-Hai Zhang
Recent advancements in 3D reconstruction from single images have been driven by the evolution of generative models.
1 code implementation • CVPR 2024 • Yunhan Yang, Yukun Huang, Xiaoyang Wu, Yuan-Chen Guo, Song-Hai Zhang, Hengshuang Zhao, Tong He, Xihui Liu
However, due to the lack of information from multiple views, these works encounter difficulties in generating controllable novel views.
no code implementations • 30 Oct 2023 • Xin Yu, Yuan-Chen Guo, Yangguang Li, Ding Liang, Song-Hai Zhang, Xiaojuan Qi
In this paper, we re-evaluate the role of classifier-free guidance in score distillation and discover a surprising finding: the guidance alone is enough for effective text-to-3D generation tasks.
1 code implementation • CVPR 2024 • Xiaoxiao Long, Yuan-Chen Guo, Cheng Lin, YuAn Liu, Zhiyang Dou, Lingjie Liu, Yuexin Ma, Song-Hai Zhang, Marc Habermann, Christian Theobalt, Wenping Wang
In this work, we introduce Wonder3D, a novel method for efficiently generating high-fidelity textured meshes from single-view images. Recent methods based on Score Distillation Sampling (SDS) have shown the potential to recover 3D geometry from 2D diffusion priors, but they typically suffer from time-consuming per-shape optimization and inconsistent geometry.
Ranked #2 on Single-View 3D Reconstruction on GSO
no code implementations • NeurIPS 2023 • Zheng Chen, Yan-Pei Cao, Yuan-Chen Guo, Chen Wang, Ying Shan, Song-Hai Zhang
Unlike generalizable radiance fields trained on perspective images, PanoGRF avoids the information loss from panorama-to-perspective conversion and directly aggregates geometry and appearance features of 3D sample points from each panoramic view based on spherical projection.
1 code implementation • CVPR 2023 • Ying-Tian Liu, Zhifei Zhang, Yuan-Chen Guo, Matthew Fisher, Zhaowen Wang, Song-Hai Zhang
Automatic generation of fonts can be an important aid to typeface design.
no code implementations • 28 Mar 2023 • Yuan-Chen Guo, Yan-Pei Cao, Chen Wang, Yu He, Ying Shan, XiaoHu Qie, Song-Hai Zhang
With the emergence of neural radiance fields (NeRFs), view synthesis quality has reached an unprecedented level.
no code implementations • ICCV 2023 • Tian-Xing Xu, Yuan-Chen Guo, Yu-Kun Lai, Song-Hai Zhang
To address these issues, we present MBPTrack, which adopts a Memory mechanism to utilize past information and formulates localization in a coarse-to-fine scheme using Box Priors given in the first frame.
no code implementations • ICCV 2023 • Chia-Hao Chen, Ying-Tian Liu, Zhifei Zhang, Yuan-Chen Guo, Song-Hai Zhang
Existing vector font generation approaches either struggle to preserve high-frequency corner details of the glyph or produce vector shapes that have redundant segments, which hinders their applications in practical scenarios.
no code implementations • CVPR 2023 • Tian-Xing Xu, Yuan-Chen Guo, Yu-Kun Lai, Song-Hai Zhang
Therefore, contextual information across two consecutive frames is crucial for effective object tracking.
no code implementations • 12 Sep 2022 • Zheng Chen, Chen Wang, Yuan-Chen Guo, Song-Hai Zhang
Neural Radiance Fields (NeRF) achieve photo-realistic view synthesis with densely captured input images.
no code implementations • 21 Jul 2022 • Tian-Xing Xu, Yuan-Chen Guo, Yong-Liang Yang, Song-Hai Zhang
Point clouds captured by depth sensors are often contaminated by noises, obstructing further analysis and applications.
no code implementations • 10 Dec 2021 • Ying-Tian Liu, Yuan-Chen Guo, Song-Hai Zhang
Is the center position fully capable of representing a pixel?
1 code implementation • 3 Dec 2021 • Chen Wang, Xian Wu, Yuan-Chen Guo, Song-Hai Zhang, Yu-Wing Tai, Shi-Min Hu
We present NeRF-SR, a solution for high-resolution (HR) novel view synthesis with mostly low-resolution (LR) inputs.
no code implementations • CVPR 2022 • Yuan-Chen Guo, Di Kang, Linchao Bao, Yu He, Song-Hai Zhang
Specifically, we propose to split a scene into transmitted and reflected components, and model the two components with separate neural radiance fields.
no code implementations • 9 Jul 2021 • Yuan Xue, Yuan-Chen Guo, Han Zhang, Tao Xu, Song-Hai Zhang, Xiaolei Huang
In many applications of computer graphics, art and design, it is desirable for a user to provide intuitive non-image input, such as text, sketch, stroke, graph or layout, and have a computer system automatically generate photo-realistic images that adhere to the input content.
no code implementations • 16 Jun 2021 • Ying-Tian Liu, Yuan-Chen Guo, Yi-Xiao Li, Chen Wang, Song-Hai Zhang
In this paper, we present a novel implicit glyph shape representation, which models glyphs as shape primitives enclosed by quadratic curves, and naturally enables generating glyph images at arbitrary high resolutions.
1 code implementation • 25 May 2021 • Tian-Xing Xu, Yuan-Chen Guo, Zhiqiang Li, Ge Yu, Yu-Kun Lai, Song-Hai Zhang
Place recognition plays an essential role in the field of autonomous driving and robot navigation.
Ranked #4 on 3D Place Recognition on CS-Campus3D
1 code implementation • CVPR 2021 • Song-Hai Zhang, Yuan-Chen Guo, Qing-Wen Gu
We investigate the problem of generating 3D meshes from single free-hand sketches, aiming at fast 3D modeling for novice users.