TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Monocular Cross-View Road Scene Parsing(Vehicle)	Argoverse	crossView	mIoU	47.87%	# 2
Monocular Cross-View Road Scene Parsing(Vehicle)	Argoverse	crossView	mAP	62.69%	# 2
Monocular Cross-View Road Scene Parsing(Road)	Argoverse	crossView	mAP	87.30%	# 2
Monocular Cross-View Road Scene Parsing(Road)	Argoverse	crossView	mIOU	76.56%	# 2
Monocular Cross-View Road Scene Parsing(Vehicle)	KITTI2012	crossView	mIoU	38.85%	# 2
Monocular Cross-View Road Scene Parsing(Vehicle)	KITTI2012	crossView	mAP	51.04%	# 2
Monocular Cross-View Road Scene Parsing(Road)	Kitti Odometry	crossView	mAP	86.39%	# 2
Monocular Cross-View Road Scene Parsing(Road)	Kitti Odometry	crossView	mIOU	77.47%	# 1
Monocular Cross-View Road Scene Parsing(Road)	Kitti Raw	crossView	mIoU	68.26%	# 1
Monocular Cross-View Road Scene Parsing(Road)	Kitti Raw	crossView	mAP	79.65%	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/projecting-your-view-attentively-monocular/monocular-cross-view-road-scene-parsing-1)](https://paperswithcode.com/sota/monocular-cross-view-road-scene-parsing-1?p=projecting-your-view-attentively-monocular)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/projecting-your-view-attentively-monocular/monocular-cross-view-road-scene-parsing-road-2)](https://paperswithcode.com/sota/monocular-cross-view-road-scene-parsing-road-2?p=projecting-your-view-attentively-monocular)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/projecting-your-view-attentively-monocular/monocular-cross-view-road-scene-parsing)](https://paperswithcode.com/sota/monocular-cross-view-road-scene-parsing?p=projecting-your-view-attentively-monocular)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/projecting-your-view-attentively-monocular/monocular-cross-view-road-scene-parsing-road)](https://paperswithcode.com/sota/monocular-cross-view-road-scene-parsing-road?p=projecting-your-view-attentively-monocular)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/projecting-your-view-attentively-monocular/monocular-cross-view-road-scene-parsing-road-1)](https://paperswithcode.com/sota/monocular-cross-view-road-scene-parsing-road-1?p=projecting-your-view-attentively-monocular)`

Projecting Your View Attentively: Monocular Road Scene Layout Estimation via Cross-View Transformation

CVPR 2021 · Weixiang Yang, Qi Li, Wenxi Liu, Yuanlong Yu, Yuexin Ma, Shengfeng He, Jia Pan ·

HD map reconstruction is crucial for autonomous driving. LiDAR-based methods are limited due to the deployed expensive sensors and time-consuming computation. Camera-based methods usually need to separately perform road segmentation and view transformation, which often causes distortion and the absence of content. To push the limits of the technology, we present a novel framework that enables reconstructing a local map formed by road layout and vehicle occupancy in the bird's-eye view given a front-view monocular image only. In particular, we propose a cross-view transformation module, which takes the constraint of cycle consistency between views into account and makes full use of their correlation to strengthen the view transformation and scene understanding. Considering the relationship between vehicles and roads, we also design a context-aware discriminator to further refine the results. Experiments on public benchmarks show that our method achieves the state-of-the-art performance in the tasks of road layout estimation and vehicle occupancy estimation. Especially for the latter task, our model outperforms all competitors by a large margin. Furthermore, our model runs at 35 FPS on a single GPU, which is efficient and applicable for real-time panorama HD map reconstruction.

PDF Abstract

Code

Add Remove Mark official

JonDoe-297/cross-view official

143

Tasks

Add Remove

Autonomous Driving

Monocular Cross-View Road Scene Parsing(Road)

Monocular Cross-View Road Scene Parsing(Vehicle)

Road Segmentation

Scene Understanding

Datasets

KITTI

Argoverse

Results from the Paper

Add Remove

Ranked #2 on Monocular Cross-View Road Scene Parsing(Road) on Kitti Raw

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Monocular Cross-View Road Scene Parsing(Vehicle)	Argoverse	crossView	mIoU	47.87%	# 2	Compare
Monocular Cross-View Road Scene Parsing(Vehicle)	Argoverse	crossView	mAP	62.69%	# 2	Compare
Monocular Cross-View Road Scene Parsing(Road)	Argoverse	crossView	mAP	87.30%	# 2	Compare
Monocular Cross-View Road Scene Parsing(Road)	Argoverse	crossView	mIOU	76.56%	# 2	Compare
Monocular Cross-View Road Scene Parsing(Vehicle)	KITTI2012	crossView	mIoU	38.85%	# 2	Compare
Monocular Cross-View Road Scene Parsing(Vehicle)	KITTI2012	crossView	mAP	51.04%	# 2	Compare
Monocular Cross-View Road Scene Parsing(Road)	Kitti Odometry	crossView	mAP	86.39%	# 2	Compare
Monocular Cross-View Road Scene Parsing(Road)	Kitti Odometry	crossView	mIOU	77.47%	# 1	Compare
Monocular Cross-View Road Scene Parsing(Road)	Kitti Raw	crossView	mIoU	68.26%	# 1	Compare
Monocular Cross-View Road Scene Parsing(Road)	Kitti Raw	crossView	mAP	79.65%	# 2	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Projecting Your View Attentively: Monocular Road Scene Layout Estimation via Cross-View Transformation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove