Search Results for author: Yanan Zhang

Found 10 papers, 3 papers with code

SA-BEV: Generating Semantic-Aware Bird's-Eye-View Feature for Multi-view 3D Object Detection

1 code implementation21 Jul 2023 Jinqing Zhang, Yanan Zhang, Qingjie Liu, Yunhong Wang

In this paper, we propose Semantic-Aware BEV Pooling (SA-BEVPool), which can filter out background information according to the semantic segmentation of image features and transform image features into semantic-aware BEV features.

3D Object Detection

OcTr: Octree-based Transformer for 3D Object Detection

no code implementations CVPR 2023 Chao Zhou, Yanan Zhang, Jiaxin Chen, Di Huang

A key challenge for LiDAR-based 3D object detection is to capture sufficient features from large scale 3D scenes especially for distant or/and occluded objects.

3D Object Detection object-detection

MetaMask: Revisiting Dimensional Confounder for Self-Supervised Learning

2 code implementations16 Sep 2022 Jiangmeng Li, Wenwen Qiang, Yanan Zhang, Wenyi Mo, Changwen Zheng, Bing Su, Hui Xiong

As a successful approach to self-supervised learning, contrastive learning aims to learn invariant information shared among distortions of the input sample.

Contrastive Learning Meta-Learning +1

Disentangle and Remerge: Interventional Knowledge Distillation for Few-Shot Object Detection from A Conditional Causal Perspective

1 code implementation26 Aug 2022 Jiangmeng Li, Yanan Zhang, Wenwen Qiang, Lingyu Si, Chengbo Jiao, Xiaohui Hu, Changwen Zheng, Fuchun Sun

To understand the reasons behind this phenomenon, we revisit the learning paradigm of knowledge distillation on the few-shot object detection task from the causal theoretic standpoint, and accordingly, develop a Structural Causal Model.

Few-Shot Learning Few-Shot Object Detection +3

CAT-Det: Contrastively Augmented Transformer for Multi-modal 3D Object Detection

no code implementations CVPR 2022 Yanan Zhang, Jiaxin Chen, Di Huang

In autonomous driving, LiDAR point-clouds and RGB images are two major data modalities with complementary cues for 3D object detection.

3D Object Detection Autonomous Driving +3

Multi-scale fusion self attention mechanism

no code implementations29 Sep 2021 Qibin Li, Nianmin Yao, Jian Zhao, Yanan Zhang

Based on the traditional attention mechanism, multi-scale fusion self attention extracts phrase information at different scales by setting convolution kernels at different levels, and calculates the corresponding attention matrix at different scales, so that the model can better extract phrase level information.

Relation Extraction

Cross Modification Attention Based Deliberation Model for Image Captioning

no code implementations17 Sep 2021 Zheng Lian, Yanan Zhang, Haichang Li, Rui Wang, Xiaohui Hu

The conventional encoder-decoder framework for image captioning generally adopts a single-pass decoding process, which predicts the target descriptive sentence word by word in temporal order.

Descriptive Image Captioning

PC-RGNN: Point Cloud Completion and Graph Neural Network for 3D Object Detection

no code implementations18 Dec 2020 Yanan Zhang, Di Huang, Yunhong Wang

LiDAR-based 3D object detection is an important task for autonomous driving and current approaches suffer from sparse and partial point clouds of distant and occluded objects.

3D Object Detection Autonomous Driving +2

Cannot find the paper you are looking for? You can Submit a new open access paper.