2 code implementations • ICLR 2022 • Takuya Hiraoka, Takahisa Imagawa, Taisei Hashimoto, Takashi Onishi, Yoshimasa Tsuruoka
To make REDQ more computationally efficient, we propose a method of improving computational efficiency called DroQ, which is a variant of REDQ that uses a small ensemble of dropout Q-functions.
no code implementations • 7 May 2021 • Taisei Hashimoto, Yoshimasa Tsuruoka
The key idea of our method is making the transition between action-decision points usable as training data by considering pseudo-actions.