Search Results for author: Nathaniel Grammel

Found 6 papers, 2 papers with code

PettingZoo: Gym for Multi-Agent Reinforcement Learning

2 code implementations • NeurIPS 2021 • J. K. Terry, Benjamin Black, Nathaniel Grammel, Mario Jayakumar, Ananth Hari, Ryan Sullivan, Luis Santos, Rodrigo Perez, Caroline Horsch, Clemens Dieffendahl, Niall L. Williams, Yashas Lokesh, Praveen Ravi

This paper introduces the PettingZoo library and the accompanying Agent Environment Cycle ("AEC") games model.

Multi-agent Reinforcement Learning reinforcement-learning +1

2,377

Paper
Code

Agent Environment Cycle Games

no code implementations • 28 Sep 2020 • Justin K. Terry, Nathaniel Grammel, Benjamin Black, Ananth Hari, Caroline Horsch, Luis Santos

Partially Observable Stochastic Games (POSGs) are the most general and common model of games used in Multi-Agent Reinforcement Learning (MARL).

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Add Code

Stochastic Optimization and Learning for Two-Stage Supplier Problems

no code implementations • 7 Aug 2020 • Brian Brubach, Nathaniel Grammel, David G. Harris, Aravind Srinivasan, Leonidas Tsepenekas, Anil Vullikanti

The main focus of this paper is radius-based (supplier) clustering in the two-stage stochastic setting with recourse, where the inherent stochasticity of the model comes in the form of a budget constraint.

Data Structures and Algorithms

Paper
Add Code

Multi-Agent Informational Learning Processes

no code implementations • 11 Jun 2020 • Justin K. Terry, Nathaniel Grammel

We introduce a new mathematical model of multi-agent reinforcement learning, the Multi-Agent Informational Learning Processor "MAILP" model.

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Add Code

Revisiting Parameter Sharing in Multi-Agent Deep Reinforcement Learning

2 code implementations • 27 May 2020 • J. K. Terry, Nathaniel Grammel, Sanghyun Son, Benjamin Black, Aakriti Agrawal

Next, we formally introduce methods to extend parameter sharing to learning in heterogeneous observation and action spaces, and prove that these methods allow for convergence to optimal policies.

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Code

Scenario Submodular Cover

no code implementations • 10 Mar 2016 • Nathaniel Grammel, Lisa Hellerstein, Devorah Kletenik, Patrick Lin

In contrast, in Stochastic Submodular Cover, the variables of the input distribution are assumed to be independent, and the distribution of each variable is given as input.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.