no code implementations • 11 Oct 2022 • Jonathan C Balloch, Julia Kim, and Jessica L Inman, Mark O Riedl
The exploration--exploitation trade-off in reinforcement learning (RL) is a well-known and much-studied problem that balances greedy action selection with novel experience, and the study of exploration methods is usually only considered in the context of learning the optimal policy for a single learning task.