Search Results for author: Runyu Ding

Found 8 papers, 6 papers with code

Can 3D Vision-Language Models Truly Understand Natural Language?

1 code implementation21 Mar 2024 Weipeng Deng, Runyu Ding, Jihan Yang, Jiahui Liu, Yijiang Li, Xiaojuan Qi, Edith Ngai

To test the language understandability of 3D-VL models, we first propose a language robustness task for systematically assessing 3D-VL models across various tasks, benchmarking their performance when presented with different language style variants.

Benchmarking

V-IRL: Grounding Virtual Intelligence in Real Life

1 code implementation5 Feb 2024 Jihan Yang, Runyu Ding, Ellis Brown, Xiaojuan Qi, Saining Xie

There is a sensory gulf between the Earth that humans inhabit and the digital realms in which modern AI agents are created.

Decision Making

Lowis3D: Language-Driven Open-World Instance-Level 3D Scene Understanding

no code implementations1 Aug 2023 Runyu Ding, Jihan Yang, Chuhui Xue, Wenqing Zhang, Song Bai, Xiaojuan Qi

To address this challenge, we propose to harness pre-trained vision-language (VL) foundation models that encode extensive knowledge from image-text pairs to generate captions for multi-view images of 3D scenes.

3D Open-Vocabulary Instance Segmentation Instance Segmentation +4

RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding

no code implementations3 Apr 2023 Jihan Yang, Runyu Ding, Zhe Wang, Xiaojuan Qi

Existing 3D scene understanding tasks have achieved high performance on close-set benchmarks but fail to handle novel categories in real-world applications.

Contrastive Learning Instance Segmentation +2

Towards Efficient 3D Object Detection with Knowledge Distillation

1 code implementation30 May 2022 Jihan Yang, Shaoshuai Shi, Runyu Ding, Zhe Wang, Xiaojuan Qi

Then, we build a benchmark to assess existing KD methods developed in the 2D domain for 3D object detection upon six well-constructed teacher-student pairs.

3D Object Detection Knowledge Distillation +3

PAConv: Position Adaptive Convolution with Dynamic Kernel Assembling on Point Clouds

2 code implementations CVPR 2021 Mutian Xu, Runyu Ding, Hengshuang Zhao, Xiaojuan Qi

The key of PAConv is to construct the convolution kernel by dynamically assembling basic weight matrices stored in Weight Bank, where the coefficients of these weight matrices are self-adaptively learned from point positions through ScoreNet.

3D Point Cloud Classification Point Cloud Classification +2

Cannot find the paper you are looking for? You can Submit a new open access paper.