no code implementations • 19 Jun 2023 • Julien Perez, Denys Proux, Claude Roux, Michael Niemaz
To leverage reinforcement learning with text-based task descriptions, we need to produce reward functions associated with individual tasks in a scalable manner.