3D Object Detection

583 papers with code • 55 benchmarks • 48 datasets

3D Object Detection is a task in computer vision where the goal is to identify and locate objects in a 3D environment based on their shape, location, and orientation. It involves detecting the presence of objects and determining their location in the 3D space in real-time. This task is crucial for applications such as autonomous vehicles, robotics, and augmented reality.

( Image credit: AVOD )

Benchmarks

Add a Result

These leaderboards are used to track progress in 3D Object Detection

Dataset	Best Model	Compare
KITTI Cars Moderate	GLENet-VR	See all
nuScenes	EA-LSS	See all
SUN-RGBD val	Point-GCC+TR3D+FF	See all
ScanNetV2	UDeerLvic	See all
KITTI Cars Easy	GLENet-VR	See all
KITTI Cars Hard	3D Dual-Fusion	See all
nuScenes Camera Only	Far3D	See all
KITTI Pedestrians Moderate	3D-FCT	See all
KITTI Cyclists Easy	3D-FCT	See all
KITTI Cyclists Moderate	3D-FCT	See all
KITTI Cyclists Hard	SA-Det3D	See all
KITTI Cars Easy val	SA-SSD+EBM	See all
KITTI Cars Moderate val	SA-SSD+EBM	See all
KITTI Cars Hard val	M3DeTR	See all
KITTI Pedestrians Easy	IPOD	See all
KITTI Pedestrians Hard	SVGA-Net	See all
DAIR-V2X-I	MonoUNI	See all
nuscenes Camera-Radar	HyDRa	See all
waymo vehicle	PillarNeXt	See all
Rope3D	MonoUNI	See all
SUN-RGBD	CAGroup3D (Geo Only)	See all
waymo cyclist	DSVT(val)	See all
waymo pedestrian	DSVT(val)	See all
S3DIS	Point-GCC+TR3D	See all
V2XSet	V2X-ViT	See all
nuScenes LiDAR only	DSVT	See all
OPV2V	V2VNet (PointPillar backbone)	See all
KITTI Pedestrian Easy val	PVCNN	See all
KITTI Pedestrian Moderate val	PVCNN	See all
KITTI Pedestrian Hard val	PVCNN	See all
KITTI Cyclist Easy val	PVCNN	See all
KITTI Cyclist Moderate val	PVCNN	See all
KITTI Cyclist Hard val	F-PointNet++ [Qi:2018fd]	See all
aiMotive Dataset	Lidar-Radar-Camera	See all
3D Object Detection on Argoverse2 Camera Only	Far3D	See all
waymo all_ns	CenterPoint	See all
NYU Depth v2	SGPN-CNN	See all
nuScenes-F	RRPN + R101 - F	See all
nuScenes-FB	RRPN + R101 - FB	See all
KITTI Pedestrian Hard	PiFeNet	See all
KITTI Cyclists Moderate val	Deformable PV-RCNN	See all
KITTI Pedestrians Moderate val	Deformable PV-RCNN	See all
Dense Fog	PV-RCNN	See all
KITTI Pedestrian Moderate	PiFeNet	See all
Heavy Snowfall	PV-RCNN	See all
Light Snowfall	PV-RCNN	See all
Clear Weather	PV-RCNN	See all
KITTI Pedestrian Easy	PiFeNet	See all
KITTI Pedestrian	PiFeNet	See all
V2X-SIM	Where2comm	See all
DAIR-V2X	CoBEVFlow	See all
Argoverse2	VoxelNeXt	See all
Cityscapes 3D	TaskPrompter	See all
Argoverse	ky_nctu_mo	See all
IRV2V	CoBEVFlow	See all

Show all 55 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find 3D Object Detection models and implementations

open-mmlab/mmdetection3d

14 papers

4,778

PaddlePaddle/Paddle3D

6 papers

533

open-mmlab/OpenPCDet

5 papers

4,297

DerrickXuNu/OpenCOOD

5 papers

592

See all 11 libraries.

Datasets

Subtasks

Robust 3D Object Detection

Robust BEV Detection

Most implemented papers

Most implemented Social Latest No code

Multimodal Token Fusion for Vision Transformers

huawei-noah/noah-research • • journal 2022

Many adaptations of transformers have emerged to address the single-modal vision tasks, where self-attention modules are stacked to handle input sources like images.

Paper
Code

Complex-YOLO: Real-time 3D Object Detection on Point Clouds

maudzung/Complex-YOLOv4-Pytorch • • 16 Mar 2018

We introduce Complex-YOLO, a state of the art real-time 3D object detection network on point clouds only.

Paper
Code

Exploring Data Augmentation for Multi-Modality 3D Object Detection

open-mmlab/mmdetection3d • • 23 Dec 2020

Due to the fact that multi-modality data augmentation must maintain consistency between point cloud and images, recent methods in this field typically use relatively insufficient data augmentation.

Paper
Code

FCOS3D: Fully Convolutional One-Stage Monocular 3D Object Detection

open-mmlab/mmdetection3d • • 22 Apr 2021

In this paper, we study this problem with a practice built on a fully convolutional single-stage detector and propose a general framework FCOS3D.

Paper
Code

PolyLoss: A Polynomial Expansion Perspective of Classification Loss Functions

frgfm/Holocron • • ICLR 2022

Cross-entropy loss and focal loss are the most common choices when training deep neural networks for classification problems.

Paper
Code

From Points to Parts: 3D Object Detection from Point Cloud with Part-aware and Part-aggregation Network

sshaoshuai/PointCloudDet3D • • 8 Jul 2019

3D object detection from LiDAR point cloud is a challenging problem in 3D scene understanding and has many practical applications.

Paper
Code

AFDet: Anchor Free One Stage 3D Object Detection

CarkusL/CenterPoint • • 23 Jun 2020

High-efficiency point cloud 3D object detection operated on embedded systems is important for many robotics applications including autonomous driving.

Paper
Code

Searching Efficient 3D Architectures with Sparse Point-Voxel Convolution

mit-han-lab/spvnas • • ECCV 2020

Self-driving cars need to understand 3D scenes efficiently and accurately in order to drive safely.

Paper
Code

CubeSLAM: Monocular 3D Object SLAM

shichaoy/cube_slam • 1 Jun 2018

Objects can provide long-range geometric and scale constraints to improve camera pose estimation and reduce monocular drift.

Paper
Code

Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection

open-mmlab/OpenPCDet • • 31 Dec 2020

In this paper, we take a slightly different viewpoint -- we find that precise positioning of raw points is not essential for high performance 3D object detection and that the coarse voxel granularity can also offer sufficient detection accuracy.

Paper
Code

3D Object Detection

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result