no code implementations • 2 Feb 2023 • Peter Sunehag, Alexander Sasha Vezhnevets, Edgar Duéñez-Guzmán, Igor Mordach, Joel Z. Leibo
The algorithm we propose consists of two parts: an agent architecture and a learning rule.
Q-Learning reinforcement-learning +1