Search Results for author: Jihan Yang

Found 14 papers, 10 papers with code

Can 3D Vision-Language Models Truly Understand Natural Language?

1 code implementation • 21 Mar 2024 • Weipeng Deng, Runyu Ding, Jihan Yang, Jiahui Liu, Yijiang Li, Xiaojuan Qi, Edith Ngai

To test the language understandability of 3D-VL models, we first propose a language robustness task for systematically assessing 3D-VL models across various tasks, benchmarking their performance when presented with different language style variants.

Benchmarking

Paper
Code

V-IRL: Grounding Virtual Intelligence in Real Life

1 code implementation • 5 Feb 2024 • Jihan Yang, Runyu Ding, Ellis Brown, Xiaojuan Qi, Saining Xie

There is a sensory gulf between the Earth that humans inhabit and the digital realms in which modern AI agents are created.

Decision Making

245

Paper
Code

Lowis3D: Language-Driven Open-World Instance-Level 3D Scene Understanding

no code implementations • 1 Aug 2023 • Runyu Ding, Jihan Yang, Chuhui Xue, Wenqing Zhang, Song Bai, Xiaojuan Qi

To address this challenge, we propose to harness pre-trained vision-language (VL) foundation models that encode extensive knowledge from image-text pairs to generate captions for multi-view images of 3D scenes.

Ranked #3 on 3D Open-Vocabulary Instance Segmentation on S3DIS

3D Open-Vocabulary Instance Segmentation Instance Segmentation +4

Paper
Add Code

RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding

no code implementations • 3 Apr 2023 • Jihan Yang, Runyu Ding, Zhe Wang, Xiaojuan Qi

Existing 3D scene understanding tasks have achieved high performance on close-set benchmarks but fail to handle novel categories in real-world applications.

Contrastive Learning Instance Segmentation +2

Paper
Add Code

PLA: Language-Driven Open-Vocabulary 3D Scene Understanding

1 code implementation • CVPR 2023 • Runyu Ding, Jihan Yang, Chuhui Xue, Wenqing Zhang, Song Bai, Xiaojuan Qi

Open-vocabulary scene understanding aims to localize and recognize unseen categories beyond the annotated label space.

Ranked #2 on 3D Open-Vocabulary Instance Segmentation on S3DIS

3D Open-Vocabulary Instance Segmentation Contrastive Learning +4

201

Paper
Code

Towards Efficient 3D Object Detection with Knowledge Distillation

1 code implementation • 30 May 2022 • Jihan Yang, Shaoshuai Shi, Runyu Ding, Zhe Wang, Xiaojuan Qi

Then, we build a benchmark to assess existing KD methods developed in the 2D domain for 3D object detection upon six well-constructed teacher-student pairs.

3D Object Detection Knowledge Distillation +3

106

Paper
Code

DODA: Data-oriented Sim-to-Real Domain Adaptation for 3D Semantic Segmentation

1 code implementation • 4 Apr 2022 • Runyu Ding, Jihan Yang, Li Jiang, Xiaojuan Qi

Deep learning approaches achieve prominent success in 3D semantic segmentation.

3D Semantic Segmentation Segmentation +1

Paper
Code

Knowledge Distillation as Efficient Pre-training: Faster Convergence, Higher Data-efficiency, and Better Transferability

1 code implementation • CVPR 2022 • Ruifei He, Shuyang Sun, Jihan Yang, Song Bai, Xiaojuan Qi

Large-scale pre-training has been proven to be crucial for various computer vision tasks.

Knowledge Distillation

Paper
Code

ST3D++: Denoised Self-training for Unsupervised Domain Adaptation on 3D Object Detection

no code implementations • 15 Aug 2021 • Jihan Yang, Shaoshuai Shi, Zhe Wang, Hongsheng Li, Xiaojuan Qi

These specific designs enable the detector to be trained on meticulously refined pseudo labeled target data with denoised training signals, and thus effectively facilitate adapting an object detector to a target domain without requiring annotations.

3D Object Detection Data Augmentation +5

Paper
Add Code

Re-distributing Biased Pseudo Labels for Semi-supervised Semantic Segmentation: A Baseline Investigation

1 code implementation • ICCV 2021 • Ruifei He, Jihan Yang, Xiaojuan Qi

In this paper, we present a simple and yet effective Distribution Alignment and Random Sampling (DARS) method to produce unbiased pseudo labels that match the true class distribution estimated from the labeled data.

Data Augmentation Segmentation +1

Paper
Code

ST3D: Self-training for Unsupervised Domain Adaptation on 3D Object Detection

1 code implementation • CVPR 2021 • Jihan Yang, Shaoshuai Shi, Zhe Wang, Hongsheng Li, Xiaojuan Qi

Then, the detector is iteratively improved on the target domain by alternatively conducting two steps, which are the pseudo label updating with the developed quality-aware triplet memory bank and the model training with curriculum data augmentation.

3D Object Detection Data Augmentation +4

282

Paper
Code

PV-RCNN: The Top-Performing LiDAR-only Solutions for 3D Detection / 3D Tracking / Domain Adaptation of Waymo Open Dataset Challenges

1 code implementation • 28 Aug 2020 • Shaoshuai Shi, Chaoxu Guo, Jihan Yang, Hongsheng Li

In this technical report, we present the top-performing LiDAR-only solutions for 3D detection, 3D tracking and domain adaptation three tracks in Waymo Open Dataset Challenges 2020.

3D Object Detection Domain Adaptation +1

4,308

Paper
Code

An Adversarial Perturbation Oriented Domain Adaptation Approach for Semantic Segmentation

no code implementations • 18 Dec 2019 • Jihan Yang, Ruijia Xu, Ruiyu Li, Xiaojuan Qi, Xiaoyong Shen, Guanbin Li, Liang Lin

In contrast to adversarial alignment, we propose to explicitly train a domain-invariant classifier by generating and defensing against pointwise feature space adversarial perturbations.

Position Segmentation +2

Paper
Add Code

Larger Norm More Transferable: An Adaptive Feature Norm Approach for Unsupervised Domain Adaptation

3 code implementations • ICCV 2019 • Ruijia Xu, Guanbin Li, Jihan Yang, Liang Lin

Domain adaptation enables the learner to safely generalize into novel environments by mitigating domain shifts across distributions.

Ranked #7 on Domain Adaptation on ImageCLEF-DA

Partial Domain Adaptation Transfer Learning +1

3,134

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.