Browse State-of-the-Art
Datasets
Methods
More
Newsletter
RC2021
About
Trends
Portals
Libraries
Sign In
Subscribe to the PwC Newsletter
×
Stay informed on the latest trending ML papers with code, research developments, libraries, methods, and datasets.
Read previous issues
Join the community
×
You need to
log in
to edit.
You can
create a new account
if you don't have one.
Or, discuss a change on
Slack
.
Methods
> Reinforcement Learning
Reinforcement Learning
92 methods • 3614 papers with code
Policy Gradient Methods
PPO
177 papers with code
DDPG
129 papers with code
REINFORCE
120 papers with code
TRPO
48 papers with code
A2C
48 papers with code
See all 24 methods
Off-Policy TD Control
Q-Learning
980 papers with code
DQN
317 papers with code
Double Q-learning
81 papers with code
Clipped Double Q-learning
52 papers with code
Double DQN
29 papers with code
See all 16 methods
Q-Learning Networks
DQN
317 papers with code
Double DQN
29 papers with code
Dueling Network
18 papers with code
REM
18 papers with code
Rainbow DQN
6 papers with code
See all 10 methods
Distributed Reinforcement Learning
IMPALA
9 papers with code
Ape-X
8 papers with code
DD-PPO
4 papers with code
SEED RL
2 papers with code
APPO
1 papers with code
See all 7 methods
Reinforcement Learning Frameworks
SCST
10 papers with code
MushroomRL
1 papers with code
Blue River Controls
1 papers with code
myGym
1 papers with code
See all 7 methods
Value Function Estimation
HOC
127 papers with code
V-trace
21 papers with code
N-step Returns
19 papers with code
Retrace
18 papers with code
Stochastic Dueling Network
7 papers with code
See all 6 methods
On-Policy TD Control
Sarsa
31 papers with code
TD Lambda
8 papers with code
Expected Sarsa
6 papers with code
True Online TD Lambda
Sarsa Lambda
See all 5 methods
Board Game Models
AlphaZero
51 papers with code
MuZero
22 papers with code
TD-Gammon
2 papers with code
See all 4 methods
Eligibility Traces
Eligibility Trace
9 papers with code
Accumulating Eligibility Trace
9 papers with code
Dutch Eligibility Trace
Replacing Eligibility Trace
See all 4 methods
Behaviour Policies
Go-Explore
6 papers with code
Epsilon Greedy Exploration
3 papers with code
See all 4 methods
Heuristic Search Algorithms
GA
115 papers with code
Monte-Carlo Tree Search
89 papers with code
4D A*
1 papers with code
HFPSO
See all 4 methods
Replay Memory
Experience Replay
477 papers with code
Prioritized Experience Replay
73 papers with code
See all 2 methods
Efficient Planning
Prioritized Sweeping
5 papers with code
See all 2 methods
Randomized Value Functions
REM
18 papers with code
Noisy Linear Layer
8 papers with code
See all 2 methods
Video Game Models
CARLA
149 papers with code
AlphaStar
6 papers with code
See all 2 methods
RL Transformers
GTrXL
2 papers with code
CoBERL
1 papers with code
See all 2 methods
Motion Control
PPMC
1 papers with code
See all 2 methods
Imitation Learning Methods
Parrot
4 papers with code
PWIL
2 papers with code
See all 2 methods
Offline Reinforcement Learning Methods
Fisher-BRC
1 papers with code
IQL
1 papers with code
See all 2 methods
Exploration Strategies
gSDE
1 papers with code
See all 1 methods
Policy Evaluation
KOVA
1 papers with code
See all 1 methods
State Similarity Metrics
Policy Similarity Metric
1 papers with code
See all 1 methods
Bayesian Reinforcement Learning
Bayesian REX
1 papers with code
See all 1 methods
Path Planning
PPMC
1 papers with code
See all 1 methods
Actor-Critic Algorithms
FORK
1 papers with code
See all 1 methods
Environment Design Methods
Protagonist Antagonist Induced Regret Environment Design
1 papers with code
See all 1 methods
Density Ratio Learning
GradientDICE
1 papers with code
See all 1 methods
Card Game Models
DouZero
2 papers with code
See all 1 methods