TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Monocular Depth Estimation	KITTI Eigen split	PackNet-SfM	absolute relative error	0.12	# 59
Monocular Depth Estimation	KITTI Eigen split unsupervised	PackNet-SfM M	absolute relative error	0.107	# 28
Monocular Depth Estimation	KITTI Object Tracking Evaluation 2012	PackNet-SfM	Abs Rel	0.071	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/packnet-sfm-3d-packing-for-self-supervised/monocular-depth-estimation-on-kitti-object)](https://paperswithcode.com/sota/monocular-depth-estimation-on-kitti-object?p=packnet-sfm-3d-packing-for-self-supervised)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/packnet-sfm-3d-packing-for-self-supervised/monocular-depth-estimation-on-kitti-eigen-1)](https://paperswithcode.com/sota/monocular-depth-estimation-on-kitti-eigen-1?p=packnet-sfm-3d-packing-for-self-supervised)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/packnet-sfm-3d-packing-for-self-supervised/monocular-depth-estimation-on-kitti-eigen)](https://paperswithcode.com/sota/monocular-depth-estimation-on-kitti-eigen?p=packnet-sfm-3d-packing-for-self-supervised)`

3D Packing for Self-Supervised Monocular Depth Estimation

CVPR 2020 · Vitor Guizilini, Rares Ambrus, Sudeep Pillai, Allan Raventos, Adrien Gaidon ·

Although cameras are ubiquitous, robotic platforms typically rely on active sensors like LiDAR for direct 3D perception. In this work, we propose a novel self-supervised monocular depth estimation method combining geometry with a new deep network, PackNet, learned only from unlabeled monocular videos. Our architecture leverages novel symmetrical packing and unpacking blocks to jointly learn to compress and decompress detail-preserving representations using 3D convolutions. Although self-supervised, our method outperforms other self, semi, and fully supervised methods on the KITTI benchmark. The 3D inductive bias in PackNet enables it to scale with input resolution and number of parameters without overfitting, generalizing better on out-of-domain data such as the NuScenes dataset. Furthermore, it does not require large-scale supervised pretraining on ImageNet and can run in real-time. Finally, we release DDAD (Dense Depth for Automated Driving), a new urban driving dataset with more challenging and accurate depth evaluation, thanks to longer-range and denser ground-truth depth generated from high-density LiDARs mounted on a fleet of self-driving cars operating world-wide.

PDF Abstract CVPR 2020 PDF CVPR 2020 Abstract

Code

Add Remove Mark official

TRI-ML/packnet-sfm official

1,200

TRI-ML/DDAD official

475

ToyotaResearchInstitute/packnet-sfm official

sejong-rcv/2021.Paper.TransDSSL

Tasks

Add Remove

Depth Estimation

Inductive Bias

Monocular Depth Estimation

Self-Driving Cars

Datasets

Introduced in the Paper:

DDAD

Used in the Paper:

ImageNet

Cityscapes

KITTI

nuScenes

Results from the Paper

Edit

Ranked #1 on Monocular Depth Estimation on KITTI Object Tracking Evaluation 2012

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Monocular Depth Estimation	KITTI Eigen split	PackNet-SfM	absolute relative error	0.12	# 59	Compare
Monocular Depth Estimation	KITTI Eigen split unsupervised	PackNet-SfM M	absolute relative error	0.107	# 28	Compare
Monocular Depth Estimation	KITTI Object Tracking Evaluation 2012	PackNet-SfM	Abs Rel	0.071	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

3D Packing for Self-Supervised Monocular Depth Estimation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove