Search Results for author: Zetong Yang

Found 10 papers, 4 papers with code

3DSSD: Point-based 3D Single Stage Object Detector

2 code implementations CVPR 2020 Zetong Yang, Yanan sun, Shu Liu, Jiaya Jia

Our method outperforms all state-of-the-art voxel-based single stage methods by a large margin, and has comparable performance to two stage point-based methods as well, with inference speed more than 25 FPS, 2x faster than former state-of-the-art point-based methods.

Object

3D-MAN: 3D Multi-frame Attention Network for Object Detection

no code implementations CVPR 2021 Zetong Yang, Yin Zhou, Zhifeng Chen, Jiquan Ngiam

In this paper, we present 3D-MAN: a 3D multi-frame attention network that effectively aggregates features from multiple perspectives and achieves state-of-the-art performance on Waymo Open Dataset.

3D Object Detection Autonomous Driving +1

A Unified Query-based Paradigm for Point Cloud Understanding

1 code implementation CVPR 2022 Zetong Yang, Li Jiang, Yanan sun, Bernt Schiele, Jiaya Jia

This is achieved by introducing an intermediate representation, i. e., Q-representation, in the querying stage to serve as a bridge between the embedding stage and task heads.

Autonomous Driving object-detection +2

Visual Point Cloud Forecasting enables Scalable Autonomous Driving

1 code implementation29 Dec 2023 Zetong Yang, Li Chen, Yanan sun, Hongyang Li

To resolve this, we bring up a new pre-training task termed as visual point cloud forecasting - predicting future point clouds from historical visual input.

Motion Forecasting

Improving Distant 3D Object Detection Using 2D Box Supervision

no code implementations14 Mar 2024 Zetong Yang, Zhiding Yu, Chris Choy, Renhao Wang, Anima Anandkumar, Jose M. Alvarez

This mapping allows the depth estimation of distant objects conditioned on their 2D boxes, making long-range 3D detection with 2D supervision feasible.

3D Object Detection Depth Estimation +2

CN: Channel Normalization For Point Cloud Recognition

no code implementations ECCV 2020 Zetong Yang, Yanan sun, Shu Liu, Xiaojuan Qi, Jiaya Jia

In 3D recognition, to fuse multi-scale structure information, existing methods apply hierarchical frameworks stacked by multiple fusion layers for integrating current relative locations with structure information from the previous level.

Cannot find the paper you are looking for? You can Submit a new open access paper.