TASK |
DATASET |
MODEL |
METRIC NAME |
METRIC VALUE |
GLOBAL RANK |
REMOVE |
Unsupervised Reinforcement Learning
|
URLB (pixels, 10^5 frames)
|
SMM
|
Walker (mean normalized return)
|
6.07±6.14
|
# 10
|
|
Unsupervised Reinforcement Learning
|
URLB (pixels, 10^5 frames)
|
SMM
|
Quadruped (mean normalized return)
|
22.52±6.44
|
# 6
|
|
Unsupervised Reinforcement Learning
|
URLB (pixels, 10^5 frames)
|
SMM
|
Jaco (mean normalized return)
|
0.99±0.61
|
# 7
|
|
Unsupervised Reinforcement Learning
|
URLB (pixels, 10^6 frames)
|
SMM
|
Walker (mean normalized return)
|
6.61±6.70
|
# 9
|
|
Unsupervised Reinforcement Learning
|
URLB (pixels, 10^6 frames)
|
SMM
|
Quadruped (mean normalized return)
|
21.21±6.10
|
# 7
|
|
Unsupervised Reinforcement Learning
|
URLB (pixels, 10^6 frames)
|
SMM
|
Jaco (mean normalized return)
|
0.99±0.61
|
# 8
|
|
Unsupervised Reinforcement Learning
|
URLB (pixels, 2*10^6 frames)
|
SMM
|
Walker (mean normalized return)
|
6.61±6.70
|
# 10
|
|
Unsupervised Reinforcement Learning
|
URLB (pixels, 2*10^6 frames)
|
SMM
|
Quadruped (mean normalized return)
|
21.21±6.10
|
# 8
|
|
Unsupervised Reinforcement Learning
|
URLB (pixels, 2*10^6 frames)
|
SMM
|
Jaco (mean normalized return)
|
0.99±0.61
|
# 9
|
|
Unsupervised Reinforcement Learning
|
URLB (pixels, 5*10^5 frames)
|
SMM
|
Walker (mean normalized return)
|
6.31±6.44
|
# 9
|
|
Unsupervised Reinforcement Learning
|
URLB (pixels, 5*10^5 frames)
|
SMM
|
Quadruped (mean normalized return)
|
21.18±6.13
|
# 8
|
|
Unsupervised Reinforcement Learning
|
URLB (pixels, 5*10^5 frames)
|
SMM
|
Jaco (mean normalized return)
|
0.99±0.61
|
# 7
|
|
Unsupervised Reinforcement Learning
|
URLB (states, 10^5 frames)
|
SMM
|
Walker (mean normalized return)
|
57.84±26.88
|
# 9
|
|
Unsupervised Reinforcement Learning
|
URLB (states, 10^5 frames)
|
SMM
|
Quadruped (mean normalized return)
|
35.53±10.16
|
# 2
|
|
Unsupervised Reinforcement Learning
|
URLB (states, 10^5 frames)
|
SMM
|
Jaco (mean normalized return)
|
26.06±6.40
|
# 8
|
|
Unsupervised Reinforcement Learning
|
URLB (states, 10^6 frames)
|
SMM
|
Walker (mean normalized return)
|
72.60±32.07
|
# 8
|
|
Unsupervised Reinforcement Learning
|
URLB (states, 10^6 frames)
|
SMM
|
Quadruped (mean normalized return)
|
37.37±4.30
|
# 6
|
|
Unsupervised Reinforcement Learning
|
URLB (states, 10^6 frames)
|
SMM
|
Jaco (mean normalized return)
|
29.96±1.37
|
# 8
|
|
Unsupervised Reinforcement Learning
|
URLB (states, 2*10^6 frames)
|
SMM
|
Walker (mean normalized return)
|
77.13±29.55
|
# 2
|
|
Unsupervised Reinforcement Learning
|
URLB (states, 2*10^6 frames)
|
SMM
|
Quadruped (mean normalized return)
|
29.95±7.59
|
# 7
|
|
Unsupervised Reinforcement Learning
|
URLB (states, 2*10^6 frames)
|
SMM
|
Jaco (mean normalized return)
|
21.87±2.77
|
# 8
|
|
Unsupervised Reinforcement Learning
|
URLB (states, 5*10^5 frames)
|
SMM
|
Walker (mean normalized return)
|
73.64±33.56
|
# 8
|
|
Unsupervised Reinforcement Learning
|
URLB (states, 5*10^5 frames)
|
SMM
|
Quadruped (mean normalized return)
|
37.20±12.78
|
# 4
|
|
Unsupervised Reinforcement Learning
|
URLB (states, 5*10^5 frames)
|
SMM
|
Jaco (mean normalized return)
|
31.95±2.95
|
# 8
|
|