TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
CARLA longest6	CARLA	World on Rails (WOR)	Driving Score	21	# 18
CARLA longest6	CARLA	World on Rails (WOR)	Route Completion	48	# 18
CARLA longest6	CARLA	World on Rails (WOR)	Infraction Score	0.56	# 12
Autonomous Driving	CARLA Leaderboard	World on Rails	Driving Score	31.37	# 13
Autonomous Driving	CARLA Leaderboard	World on Rails	Route Completion	57.65	# 13
Autonomous Driving	CARLA Leaderboard	World on Rails	Infraction penalty	0.56	# 13

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-to-drive-from-a-world-on-rails/autonomous-driving-on-carla-leaderboard)](https://paperswithcode.com/sota/autonomous-driving-on-carla-leaderboard?p=learning-to-drive-from-a-world-on-rails)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/learning-to-drive-from-a-world-on-rails/carla-longest6-on-carla)](https://paperswithcode.com/sota/carla-longest6-on-carla?p=learning-to-drive-from-a-world-on-rails)`

Learning to drive from a world on rails

ICCV 2021 · Dian Chen, Vladlen Koltun, Philipp Krähenbühl ·

We learn an interactive vision-based driving policy from pre-recorded driving logs via a model-based approach. A forward model of the world supervises a driving policy that predicts the outcome of any potential driving trajectory. To support learning from pre-recorded logs, we assume that the world is on rails, meaning neither the agent nor its actions influence the environment. This assumption greatly simplifies the learning problem, factorizing the dynamics into a nonreactive world model and a low-dimensional and compact forward model of the ego-vehicle. Our approach computes action-values for each training trajectory using a tabular dynamic-programming evaluation of the Bellman equations; these action-values in turn supervise the final vision-based driving policy. Despite the world-on-rails assumption, the final driving policy acts well in a dynamic and reactive world. At the time of writing, our method ranks first on the CARLA leaderboard, attaining a 25% higher driving score while using 40 times less data. Our method is also an order of magnitude more sample-efficient than state-of-the-art model-free reinforcement learning techniques on navigational tasks in the ProcGen benchmark.

PDF Abstract ICCV 2021 PDF ICCV 2021 Abstract

Code

Add Remove Mark official

dotchen/WorldOnRails official

158

Tasks

Add Remove

Autonomous Driving

CARLA longest6

Model-based Reinforcement Learning

Datasets

CARLA

Waymo Open Dataset

ProcGen

Results from the Paper

Edit

Ranked #12 on Autonomous Driving on CARLA Leaderboard

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
CARLA longest6	CARLA	World on Rails (WOR)	Driving Score	21	# 18	Compare
			Route Completion	48	# 18	Compare
			Infraction Score	0.56	# 12	Compare
Autonomous Driving	CARLA Leaderboard	World on Rails	Driving Score	31.37	# 13	Compare
			Route Completion	57.65	# 13	Compare
			Infraction penalty	0.56	# 13	Compare

Methods

Add Remove

CARLA • Entropy Regularization • PPO

Edit Social Preview

Learning to drive from a world on rails

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove