no code implementations • 18 Jun 2021 • Chenjun Xiao, Ilbin Lee, Bo Dai, Dale Schuurmans, Csaba Szepesvari
In high stake applications, active experimentation may be considered too risky and thus data are often collected passively.
reinforcement-learning Reinforcement Learning (RL)