1 code implementation • 5 Sep 2024 • Yunze Man, Shuhong Zheng, Zhipeng Bao, Martial Hebert, Liang-Yan Gui, Yu-Xiong Wang
To address this issue, we present a comprehensive study that probes various visual encoding models for 3D scene understanding, identifying the strengths and limitations of each model across different scenarios.
Ranked #3 on Question Answering on SQA3D
no code implementations • 10 Dec 2023 • Zhipeng Bao, Yijun Li, Krishna Kumar Singh, Yu-Xiong Wang, Martial Hebert
Despite recent significant strides achieved by diffusion-based Text-to-Image (T2I) models, current systems are still less capable of ensuring decent compositional generation aligned with text prompts, particularly for the multi-object generation.
1 code implementation • ICCV 2023 • Shuhong Zheng, Zhipeng Bao, Martial Hebert, Yu-Xiong Wang
To tackle the MTVS problem, we propose MuvieNeRF, a framework that incorporates both multi-task and cross-view knowledge to simultaneously synthesize multiple scene properties.
2 code implementations • CVPR 2023 • Zhipeng Bao, Pavel Tokmakov, Yu-Xiong Wang, Adrien Gaidon, Martial Hebert
Object discovery -- separating objects from the background without manual labels -- is a fundamental open challenge in computer vision.
no code implementations • 9 Jun 2022 • Mingtong Zhang, Shuhong Zheng, Zhipeng Bao, Martial Hebert, Yu-Xiong Wang
Comprehensive 3D scene understanding, both geometrically and semantically, is important for real-world applications such as robot perception.
1 code implementation • CVPR 2022 • Zhipeng Bao, Pavel Tokmakov, Allan Jabri, Yu-Xiong Wang, Adrien Gaidon, Martial Hebert
Our experiments demonstrate that, despite only capturing a small subset of the objects that move, this signal is enough to generalize to segment both moving and static instances of dynamic objects.
no code implementations • 29 Sep 2021 • Zhipeng Bao, Yu-Xiong Wang, Martial Hebert
Generative modeling has recently shown great promise in computer vision, but it has mostly focused on synthesizing visually realistic images.
no code implementations • 25 Jun 2021 • Zhipeng Bao, Martial Hebert, Yu-Xiong Wang
Generative modeling has recently shown great promise in computer vision, but it has mostly focused on synthesizing visually realistic images.
1 code implementation • ICLR 2021 • Zhipeng Bao, Yu-Xiong Wang, Martial Hebert
We propose a novel task of joint few-shot recognition and novel-view synthesis: given only one or few images of a novel object from arbitrary views with only category annotation, we aim to simultaneously learn an object classifier and generate images of that type of object from new viewpoints.
no code implementations • 27 Aug 2019 • Ziqian Luo, Xiangrui Zeng, Zhipeng Bao, Min Xu
Deep learning model trained by imbalanced data may not work satisfactorily since it could be determined by major classes and thus may ignore the classes with small amount of data.