no code implementations • 16 Jul 2024 • Aske Plaat, Annie Wong, Suzan Verberne, Joost Broekens, Niki van Stein, Thomas Back
The field started with the question whether LLMs can solve grade school math word problems.
1 code implementation • 10 Feb 2024 • Annie Wong, Jacob de Nobel, Thomas Bäck, Aske Plaat, Anna V. Kononova
We benchmark both deep policy networks and networks consisting of a single linear layer from observations to actions for three gradient-based methods, such as Proximal Policy Optimization.
no code implementations • 29 Jun 2021 • Annie Wong, Thomas Bäck, Anna V. Kononova, Aske Plaat
This paper surveys the field of deep multiagent reinforcement learning.