Search Results for author: Long Zeng

Found 20 papers, 7 papers with code

DSM: Building A Diverse Semantic Map for 3D Visual Grounding

no code implementations11 Apr 2025 Qinghongbing Xie, Zijian Liang, Long Zeng

This method leverages VLMs to capture the latent semantic attributes and relations of objects within the scene and creates a Diverse Semantic Map (DSM) through a geometry sliding-window map construction strategy.

3D visual grounding Scene Understanding +1

SPF-Portrait: Towards Pure Portrait Customization with Semantic Pollution-Free Fine-tuning

no code implementations1 Apr 2025 Xiaole Xian, Zhichao Liao, Qingyu Li, Wenyu Qin, Pengfei Wan, Weicheng Xie, Long Zeng, Linlin Shen, Pingfa Feng

Fine-tuning a pre-trained Text-to-Image (T2I) model on a tailored portrait dataset is the mainstream method for text-driven customization of portrait attributes.

Contrastive Learning Incremental Learning

NuGrounding: A Multi-View 3D Visual Grounding Framework in Autonomous Driving

no code implementations28 Mar 2025 Fuhao Li, Huan Jin, Bin Gao, Liaoyuan Fan, Lihui Jiang, Long Zeng

Multi-view 3D visual grounding is critical for autonomous driving vehicles to interpret natural languages and localize target objects in complex environments.

3D visual grounding Autonomous Driving +1

Diffusion Suction Grasping with Large-Scale Parcel Dataset

no code implementations11 Feb 2025 Ding-Tao Huang, Xinyi He, Debei Hua, Dongfang Yu, En-Te Lin, Long Zeng

While recent advances in object suction grasping have shown remarkable progress, significant challenges persist particularly in cluttered and complex parcel handling scenarios.

Denoising

DreamFit: Garment-Centric Human Generation via a Lightweight Anything-Dressing Encoder

no code implementations23 Dec 2024 Ente Lin, Xujie Zhang, Fuwei Zhao, Yuxuan Luo, Xin Dong, Long Zeng, Xiaodan Liang

However, existing methods often face a dilemma: lightweight approaches, such as adapters, are prone to generate inconsistent textures; while finetune-based methods involve high training costs and struggle to maintain the generalization capabilities of pretrained diffusion models, limiting their performance across diverse scenarios.

Training-Free Point Cloud Recognition Based on Geometric and Semantic Information Fusion

no code implementations7 Sep 2024 Yan Chen, Di Huang, Zhichao Liao, Xi Cheng, Xinghui Li, Long Zeng

For the geometric branch, we adopt a non-parametric strategy to extract geometric features.

Freehand Sketch Generation from Mechanical Components

1 code implementation12 Aug 2024 Zhichao Liao, Di Huang, Heming Fang, Yue Ma, Fengyuan Piao, Xinghui Li, Long Zeng, Pingfa Feng

To address this issue, we design a two-stage generative framework mimicking the human sketching behavior pattern, called MSFormer, which is the first time to produce humanoid freehand sketches tailored for mechanical components.

Automated Peer Reviewing in Paper SEA: Standardization, Evaluation, and Analysis

1 code implementation9 Jul 2024 Jianxiang Yu, Zichen Ding, Jiaqi Tan, Kangyang Luo, Zhenmin Weng, Chenghua Gong, Long Zeng, Renjing Cui, Chengcheng Han, Qiushi Sun, Zhiyong Wu, Yunshi Lan, Xiang Li

Finally, SEA-A introduces a new evaluation metric called mismatch score to assess the consistency between paper contents and reviews.

SD-Net: Symmetric-Aware Keypoint Prediction and Domain Adaptation for 6D Pose Estimation In Bin-picking Scenarios

1 code implementation14 Mar 2024 Ding-Tao Huang, En-Te Lin, Lipeng Chen, Li-Fu Liu, Long Zeng

Specifically, at the keypoint prediction stage, we designe a robust 3D keypoints selection strategy considering the symmetry class of objects and equivalent keypoints, which facilitate locating 3D keypoints even in highly occluded scenes.

3D geometry 6D Pose Estimation +1

NormNet: Scale Normalization for 6D Pose Estimation in Stacked Scenarios

no code implementations15 Nov 2023 En-Te Lin, Wei-Jie Lv, Ding-Tao Huang, Long Zeng

Existing Object Pose Estimation (OPE) methods for stacked scenarios are not robust to changes in object scale.

6D Pose Estimation Semantic Segmentation +1

Medical Image Segmentation via Sparse Coding Decoder

no code implementations17 Oct 2023 Long Zeng, Kaigui Wu

Transformers have achieved significant success in medical image segmentation, owing to its capability to capture long-range dependencies.

Decoder Image Segmentation +2

PCKRF: Point Cloud Completion and Keypoint Refinement With Fusion Data for 6D Pose Estimation

1 code implementation7 Oct 2022 Yiheng Han, Irvin Haozhe Zhan, Long Zeng, Yu-Ping Wang, Ran Yi, MinJing Yu, Matthieu Gaetan Lin, Jenny Sheng, Yong-Jin Liu

In this paper, we propose Point Cloud Completion and Keypoint Refinement with Fusion Data (PCKRF), a new pose refinement pipeline for 6D pose estimation.

6D Pose Estimation Point Cloud Completion +1

SpiderCNN: Deep Learning on Point Sets with Parameterized Convolutional Filters

1 code implementation ECCV 2018 Yifan Xu, Tianqi Fan, Mingye Xu, Long Zeng, Yu Qiao

Deep neural networks have enjoyed remarkable success for various vision tasks, however it remains challenging to apply CNNs to domains lacking a regular underlying structures such as 3D point clouds.

3D Part Segmentation 3D Point Cloud Classification +1

Cannot find the paper you are looking for? You can Submit a new open access paper.