Model-free reinforcement learning (RL) algorithms, such as Q-learning, directly parameterize and update value functions or policies without explicitly modeling the environment. They are typically simpler, more flexible to use, and thus more prevalent in modern deep RL than model-based approaches... (read more)
PDF Abstract NeurIPS 2018 PDF NeurIPS 2018 AbstractMETHOD | TYPE | |
---|---|---|
![]() |
Off-Policy TD Control |