no code implementations • 29 Aug 2024 • Fangfu Liu, Wenqiang Sun, HanYang Wang, Yikai Wang, Haowen Sun, Junliang Ye, Jun Zhang, Yueqi Duan
Advancements in 3D scene reconstruction have transformed 2D images from the real world into 3D models, producing realistic 3D results from hundreds of input photos.
no code implementations • 22 Aug 2024 • Weiliang Chen, Fangfu Liu, Diankun Wu, Haowen Sun, Haixu Song, Yueqi Duan
We are living in a flourishing era of digital media, where everyone has the potential to become a personal filmmaker.
1 code implementation • 2 Jul 2024 • Yilong Lai, Jialong Wu, Congzhi Zhang, Haowen Sun, Deyu Zhou
Conversational Query Reformulation (CQR) has significantly advanced in addressing the challenges of conversational search, particularly those stemming from the latent user intent and the need for historical context.
1 code implementation • 6 May 2024 • Haowen Sun, Ruikun Zheng, Haibin Huang, Chongyang Ma, Hui Huang, Ruizhen Hu
In this paper, we introduce LGTM, a novel Local-to-Global pipeline for Text-to-Motion generation.
1 code implementation • 22 Mar 2024 • Sisi Dai, Wenhao Li, Haowen Sun, Haibin Huang, Chongyang Ma, Hui Huang, Kai Xu, Ruizhen Hu
In this study, we tackle the complex task of generating 3D human-object interactions (HOI) from textual descriptions in a zero-shot text-to-3D manner.
no code implementations • 14 Mar 2024 • Fangfu Liu, HanYang Wang, Weiliang Chen, Haowen Sun, Yueqi Duan
Recent years have witnessed the strong power of 3D generation models, which offer a new level of creative flexibility by allowing users to guide the 3D content generation process through a single image or natural language.
no code implementations • CVPR 2024 • Haowen Sun, Yueqi Duan, Juncheng Yan, Yifan Liu, Jiwen Lu
Nowadays leveraging 2D images and pre-trained models to guide 3D point cloud feature representation has shown a remarkable potential to boost the performance of 3D fundamental models.
1 code implementation • 17 Dec 2021 • An Tao, Yueqi Duan, He Wang, Ziyi Wu, Pengliang Ji, Haowen Sun, Jie zhou, Jiwen Lu
It results in a serious issue of lagged gradient, making the learned attack at the current step ineffective due to the architecture changes afterward.
no code implementations • 24 Oct 2021 • Haowen Sun, Taiyong Wang
Given an RGBD image, our network is trained to predict pixel category and the translation to edge points and center points.