ConQUR: Mitigating Delusional Bias in Deep Q-learning

ICLR 2020 Andy SuJayden OoiTyler LuDale SchuurmansCraig Boutilier

Delusional bias is a fundamental source of error in approximate Q-learning. To date, the only techniques that explicitly address delusion require comprehensive search using tabular value estimates... (read more)

PDF Abstract


No code implementations yet. Submit your code now

Results from the Paper

  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.