no code implementations • 3 Apr 2024 • William Macke, Michael Doyle
Large Language Models (LLMs) have demonstrated impressive abilities in recent years with regards to code generation and understanding.
no code implementations • 16 Feb 2022 • Reuth Mirsky, Ignacio Carlucho, Arrasy Rahman, Elliot Fosong, William Macke, Mohan Sridharan, Peter Stone, Stefano V. Albrecht
Ad hoc teamwork is the research problem of designing agents that can collaborate with new teammates without prior coordination.
1 code implementation • 3 Dec 2021 • Yulin Zhang, William Macke, Jiaxun Cui, Daniel Urieli, Peter Stone
This article establishes for the first time that a multiagent driving policy can be trained in such a way that it generalizes to different traffic flows, AV penetration, and road geometries, including on multi-lane roads.
no code implementations • 1 Mar 2021 • William Macke, Reuth Mirsky, Peter Stone
We then present a novel planning algorithm for ad hoc teamwork, determining which query to ask and planning accordingly.
1 code implementation • 26 Feb 2021 • Jiaxun Cui, William Macke, Harel Yedidsion, Daniel Urieli, Peter Stone
Next, we propose a modular transfer reinforcement learning approach, and use it to scale up a multiagent driving policy to outperform human-like traffic and existing approaches in a simulated realistic scenario, which is an order of magnitude larger than past scenarios (hundreds instead of tens of vehicles).
no code implementations • 17 Feb 2020 • Garrett Bingham, William Macke, Risto Miikkulainen
The choice of activation function can have a large effect on the performance of a neural network.