Randomized Value Functions

Random Ensemble Mixture

Introduced by Agarwal et al. in An Optimistic Perspective on Offline Reinforcement Learning

Random Ensemble Mixture (REM) is an easy to implement extension of DQN inspired by Dropout. The key intuition behind REM is that if one has access to multiple estimates of Q-values, then a weighted combination of the Q-value estimates is also an estimate for Q-values. Accordingly, in each training step, REM randomly combines multiple Q-value estimates and uses this random combination for robust training.

Source: An Optimistic Perspective on Offline Reinforcement Learning

Papers


Paper Code Results Date Stars

Tasks


Components


Component Type
DQN
Q-Learning Networks

Categories