Search Results for author: Xiangyang Zhu

Found 9 papers, 7 papers with code

No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation

4 code implementations • 5 Apr 2024 • Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Jiaming Liu, Han Xiao, Chaoyou Fu, Hao Dong, Peng Gao

To reduce the reliance on large-scale datasets, recent works in 3D segmentation resort to few-shot learning.

Few-Shot Learning Scene Segmentation +1

438

Paper
Code

Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following

5 code implementations • 1 Sep 2023 • Ziyu Guo, Renrui Zhang, Xiangyang Zhu, Yiwen Tang, Xianzheng Ma, Jiaming Han, Kexin Chen, Peng Gao, Xianzhi Li, Hongsheng Li, Pheng-Ann Heng

We introduce Point-Bind, a 3D multi-modality model aligning point clouds with 2D image, language, audio, and video.

Ranked #5 on 3D Question Answering (3D-QA) on 3D MM-Vet

3D Generation 3D Question Answering (3D-QA) +4

381

Paper
Code

Less is More: Towards Efficient Few-shot 3D Semantic Segmentation via Training-free Networks

1 code implementation • 24 Aug 2023 • Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Jiaming Liu, Hao Dong, Peng Gao

However, the prior pre-training stage not only introduces excessive time overhead, but also incurs a significant domain gap on `unseen' classes.

3D Semantic Segmentation Few-shot 3D semantic segmentation +1

Paper
Code

Efficient Multi-View Inverse Rendering Using a Hybrid Differentiable Rendering Method

no code implementations • 19 Aug 2023 • Xiangyang Zhu, Yiling Pan, Bailin Deng, Bin Wang

In this paper, we introduce a novel hybrid differentiable rendering method to efficiently reconstruct the 3D geometry and reflectance of a scene from multi-view images captured by conventional hand-held cameras.

Inverse Rendering

Paper
Add Code

Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement

1 code implementation • ICCV 2023 • Xiangyang Zhu, Renrui Zhang, Bowei He, Aojun Zhou, Dong Wang, Bin Zhao, Peng Gao

The popularity of Contrastive Language-Image Pre-training (CLIP) has propelled its application to diverse downstream vision tasks.

Computational Efficiency Few-Shot Learning

120

Paper
Code

PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning

2 code implementations • ICCV 2023 • Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Ziyao Zeng, Zipeng Qin, Shanghang Zhang, Peng Gao

In this paper, we first collaborate CLIP and GPT to be a unified 3D open-world learner, named as PointCLIP V2, which fully unleashes their potential for zero-shot 3D classification, segmentation, and detection.

Ranked #2 on 3D Open-Vocabulary Instance Segmentation on STPLS3D

3D Classification 3D Object Detection +11

291

Paper
Code

Conditional Generative Models for Simulation of EMG During Naturalistic Movements

1 code implementation • 3 Nov 2022 • Shihan Ma, Alexander Kenneth Clarke, Kostiantyn Maksymenko, Samuel Deslauriers-Gauthier, Xinjun Sheng, Xiangyang Zhu, Dario Farina

As a solution to this problem, we propose a transfer learning approach, in which a conditional generative model is trained to mimic the output of an advanced numerical model.

Data Augmentation Transfer Learning

Paper
Code

Steerable Pyramid Transform Enables Robust Left Ventricle Quantification

1 code implementation • 20 Jan 2022 • Xiangyang Zhu, Kede Ma, Wufeng Xue

First, the basis functions of SPT match the anatomical structure of the LV as well as the geometric characteristics of the estimated indices.

Paper
Code

Learning to Navigate from Simulation via Spatial and Semantic Information Synthesis with Noise Model Embedding

no code implementations • 13 Oct 2019 • Gang Chen, Hongzhe Yu, Wei Dong, Xinjun Sheng, Xiangyang Zhu, Han Ding

While training an end-to-end navigation network in the real world is usually of high cost, simulation provides a safe and cheap environment in this training stage.

Navigate

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.