no code implementations • 11 Apr 2025 • Qinghongbing Xie, Zijian Liang, Long Zeng
This method leverages VLMs to capture the latent semantic attributes and relations of objects within the scene and creates a Diverse Semantic Map (DSM) through a geometry sliding-window map construction strategy.
no code implementations • 1 Apr 2025 • Xiaole Xian, Zhichao Liao, Qingyu Li, Wenyu Qin, Pengfei Wan, Weicheng Xie, Long Zeng, Linlin Shen, Pingfa Feng
Fine-tuning a pre-trained Text-to-Image (T2I) model on a tailored portrait dataset is the mainstream method for text-driven customization of portrait attributes.
no code implementations • 31 Mar 2025 • Zhichao Liao, Xiaokun Liu, Wenyu Qin, Qingyu Li, Qiulin Wang, Pengfei Wan, Di Zhang, Long Zeng, Pingfa Feng
Image Aesthetic Assessment (IAA) is a long-standing and challenging research task.
no code implementations • 28 Mar 2025 • Fuhao Li, Huan Jin, Bin Gao, Liaoyuan Fan, Lihui Jiang, Long Zeng
Multi-view 3D visual grounding is critical for autonomous driving vehicles to interpret natural languages and localize target objects in complex environments.
no code implementations • 11 Feb 2025 • Ding-Tao Huang, Xinyi He, Debei Hua, Dongfang Yu, En-Te Lin, Long Zeng
While recent advances in object suction grasping have shown remarkable progress, significant challenges persist particularly in cluttered and complex parcel handling scenarios.
no code implementations • 23 Dec 2024 • Ente Lin, Xujie Zhang, Fuwei Zhao, Yuxuan Luo, Xin Dong, Long Zeng, Xiaodan Liang
However, existing methods often face a dilemma: lightweight approaches, such as adapters, are prone to generate inconsistent textures; while finetune-based methods involve high training costs and struggle to maintain the generalization capabilities of pretrained diffusion models, limiting their performance across diverse scenarios.
no code implementations • 4 Dec 2024 • Wanting Zhang, Haodong Xiang, Zhichao Liao, Xiansong Lai, Xinghui Li, Long Zeng
In this paper, we introduce 2DGS-Room, a novel method leveraging 2D Gaussian Splatting for high-fidelity indoor scene reconstruction.
no code implementations • 12 Nov 2024 • Xi Cheng, Ruiqi Lei, Di Huang, Zhichao Liao, Fengyuan Piao, Yan Chen, Pingfa Feng, Long Zeng
Stage 1 extracts constraints from B-Rep data or point cloud.
no code implementations • 7 Sep 2024 • Yan Chen, Di Huang, Zhichao Liao, Xi Cheng, Xinghui Li, Long Zeng
For the geometric branch, we adopt a non-parametric strategy to extract geometric features.
1 code implementation • 12 Aug 2024 • Zhichao Liao, Di Huang, Heming Fang, Yue Ma, Fengyuan Piao, Xinghui Li, Long Zeng, Pingfa Feng
To address this issue, we design a two-stage generative framework mimicking the human sketching behavior pattern, called MSFormer, which is the first time to produce humanoid freehand sketches tailored for mechanical components.
1 code implementation • 9 Jul 2024 • Jianxiang Yu, Zichen Ding, Jiaqi Tan, Kangyang Luo, Zhenmin Weng, Chenghua Gong, Long Zeng, Renjing Cui, Chengcheng Han, Qiushi Sun, Zhiyong Wu, Yunshi Lan, Xiang Li
Finally, SEA-A introduces a new evaluation metric called mismatch score to assess the consistency between paper contents and reviews.
no code implementations • 30 May 2024 • Haodong Xiang, Xinghui Li, Kai Cheng, Xiansong Lai, Wanting Zhang, Zhichao Liao, Long Zeng, Xueping Liu
Embodied intelligence requires precise reconstruction and rendering to simulate large-scale real-world data.
1 code implementation • 14 Mar 2024 • Ding-Tao Huang, En-Te Lin, Lipeng Chen, Li-Fu Liu, Long Zeng
Specifically, at the keypoint prediction stage, we designe a robust 3D keypoints selection strategy considering the symmetry class of objects and equivalent keypoints, which facilitate locating 3D keypoints even in highly occluded scenes.
no code implementations • 15 Nov 2023 • En-Te Lin, Wei-Jie Lv, Ding-Tao Huang, Long Zeng
Existing Object Pose Estimation (OPE) methods for stacked scenarios are not robust to changes in object scale.
no code implementations • 17 Oct 2023 • Long Zeng, Kaigui Wu
Transformers have achieved significant success in medical image segmentation, owing to its capability to capture long-range dependencies.
no code implementations • 9 Nov 2022 • Liang Zhao, Xinyuan Zhao, Hailong Ma, Xinyu Zhang, Long Zeng
We then fill the hole in the target image with the contents of the aligned image.
1 code implementation • 7 Oct 2022 • Yiheng Han, Irvin Haozhe Zhan, Long Zeng, Yu-Ping Wang, Ran Yi, MinJing Yu, Matthieu Gaetan Lin, Jenny Sheng, Yong-Jin Liu
In this paper, we propose Point Cloud Completion and Keypoint Refinement with Fusion Data (PCKRF), a new pose refinement pipeline for 6D pose estimation.
1 code implementation • CVPR 2022 • Fan Yang, Kai Wu, Shuyi Zhang, Guannan Jiang, Yong liu, Feng Zheng, Wei zhang, Chengjie Wang, Long Zeng
Pseudo-label-based semi-supervised learning (SSL) has achieved great success on raw data utilization.
1 code implementation • IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2019 • Zhikai Dong, Sicheng Liu, Tao Zhou, Hui Cheng, Long Zeng, Xingyao Yu, Houde Liu
For each point in the point cloud, the network regresses a 6D pose of the object instance that the point belongs to.
1 code implementation • ECCV 2018 • Yifan Xu, Tianqi Fan, Mingye Xu, Long Zeng, Yu Qiao
Deep neural networks have enjoyed remarkable success for various vision tasks, however it remains challenging to apply CNNs to domains lacking a regular underlying structures such as 3D point clouds.
Ranked #6 on
3D Part Segmentation
on IntrA