TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Control with Prametrised Actions	Half Field Offence	Hybrid SAC	Goal Probability	0.639	# 2
Control with Prametrised Actions	Platform	Hybrid SAC	Return	0.981	# 2
Control with Prametrised Actions	Robot Soccer Goal	Hybrid SAC	Goal Probability	0.728	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/discrete-and-continuous-action-representation/control-with-prametrised-actions-on-half)](https://paperswithcode.com/sota/control-with-prametrised-actions-on-half?p=discrete-and-continuous-action-representation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/discrete-and-continuous-action-representation/control-with-prametrised-actions-on-platform)](https://paperswithcode.com/sota/control-with-prametrised-actions-on-platform?p=discrete-and-continuous-action-representation)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/discrete-and-continuous-action-representation/control-with-prametrised-actions-on-robot)](https://paperswithcode.com/sota/control-with-prametrised-actions-on-robot?p=discrete-and-continuous-action-representation)`

Discrete and Continuous Action Representation for Practical RL in Video Games

23 Dec 2019 · Olivier Delalleau, Maxim Peter, Eloi Alonso, Adrien Logut ·

While most current research in Reinforcement Learning (RL) focuses on improving the performance of the algorithms in controlled environments, the use of RL under constraints like those met in the video game industry is rarely studied. Operating under such constraints, we propose Hybrid SAC, an extension of the Soft Actor-Critic algorithm able to handle discrete, continuous and parameterized actions in a principled way. We show that Hybrid SAC can successfully solve a highspeed driving task in one of our games, and is competitive with the state-of-the-art on parameterized actions benchmark tasks. We also explore the impact of using normalizing flows to enrich the expressiveness of the policy at minimal computational cost, and identify a potential undesired effect of SAC when used with normalizing flows, that may be addressed by optimizing a different objective.

PDF Abstract

Code

Add Remove Mark official

nisheeth-golakiya/hybrid-sac

Tasks

Add Remove

Control with Prametrised Actions

Reinforcement Learning (RL)

Datasets

Add Datasets introduced or used in this paper

Results from the Paper

Edit

Ranked #2 on Control with Prametrised Actions on Platform

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Control with Prametrised Actions	Half Field Offence	Hybrid SAC	Goal Probability	0.639	# 2	Compare
Control with Prametrised Actions	Platform	Hybrid SAC	Return	0.981	# 2	Compare
Control with Prametrised Actions	Robot Soccer Goal	Hybrid SAC	Goal Probability	0.728	# 2	Compare

Methods

Add Remove

Adam • Dense Connections • Experience Replay • Normalizing Flows • ReLU • Soft Actor-Critic (Autotuned Temperature)

Edit Social Preview

Discrete and Continuous Action Representation for Practical RL in Video Games

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove