Search Results for author: Naomi Ehrich Leonard

Found 20 papers, 3 papers with code

Unsupervised Learning of Lagrangian Dynamics from Images for Prediction and Control

1 code implementation • NeurIPS 2020 • Yaofeng Desmond Zhong, Naomi Ehrich Leonard

The VAE is designed to account for the geometry of physical systems composed of multiple rigid bodies in the plane.

Paper
Code

Emergent Coordination through Game-Induced Nonlinear Opinion Dynamics

1 code implementation • 5 Apr 2023 • Haimin Hu, Kensuke Nakamura, Kai-Chieh Hsu, Naomi Ehrich Leonard, Jaime Fernández Fisac

We present a multi-agent decision-making framework for the emergent coordination of autonomous agents whose intents are initially undecided.

Decision Making

Paper
Code

Satisficing in multi-armed bandit problems

no code implementations • 23 Dec 2015 • Paul Reverdy, Vaibhav Srivastava, Naomi Ehrich Leonard

Satisficing is a relaxation of maximizing and allows for less risky decision making in the face of uncertainty.

Decision Making

Paper
Add Code

Distributed Cooperative Decision-Making in Multiarmed Bandits: Frequentist and Bayesian Algorithms

no code implementations • 2 Jun 2016 • Peter Landgren, Vaibhav Srivastava, Naomi Ehrich Leonard

We study distributed cooperative decision-making under the explore-exploit tradeoff in the multiarmed bandit (MAB) problem.

Decision Making

Paper
Add Code

On Distributed Cooperative Decision-Making in Multiarmed Bandits

no code implementations • 21 Dec 2015 • Peter Landgren, Vaibhav Srivastava, Naomi Ehrich Leonard

We study the explore-exploit tradeoff in distributed cooperative decision-making using the context of the multiarmed bandit (MAB) problem.

Decision Making

Paper
Add Code

Correlated Multiarmed Bandit Problem: Bayesian Algorithms and Regret Analysis

no code implementations • 5 Jul 2015 • Vaibhav Srivastava, Paul Reverdy, Naomi Ehrich Leonard

We consider the correlated multiarmed bandit (MAB) problem in which the rewards associated with each arm are modeled by a multivariate Gaussian random variable, and we investigate the influence of the assumptions in the Bayesian prior on the performance of the upper credible limit (UCL) algorithm and a new correlated UCL algorithm.

Decision Making

Paper
Add Code

Cooperative learning in multi-agent systems from intermittent measurements

no code implementations • 11 Sep 2012 • Naomi Ehrich Leonard, Alex Olshevsky

Motivated by the problem of tracking a direction in a decentralized way, we consider the general problem of cooperative learning in multi-agent systems with time-varying connectivity and intermittent measurements.

Paper
Add Code

Heterogeneous Stochastic Interactions for Multiple Agents in a Multi-armed Bandit Problem

no code implementations • 21 May 2019 • Udari Madhushani, Naomi Ehrich Leonard

We define and analyze a multi-agent multi-armed bandit problem in which decision-making agents can observe the choices and rewards of their neighbors.

Decision Making

Paper
Add Code

Distributed Cooperative Decision Making in Multi-agent Multi-armed Bandits

no code implementations • 3 Mar 2020 • Peter Landgren, Vaibhav Srivastava, Naomi Ehrich Leonard

And we consider a constrained reward model in which agents that choose the same arm at the same time receive no reward.

Decision Making Multi-Armed Bandits

Paper
Add Code

A Dynamic Observation Strategy for Multi-agent Multi-armed Bandit Problem

no code implementations • 8 Apr 2020 • Udari Madhushani, Naomi Ehrich Leonard

We define and analyze a multi-agent multi-armed bandit problem in which decision-making agents can observe the choices and rewards of their neighbors under a linear observation cost.

Decision Making

Paper
Add Code

Distributed Learning: Sequential Decision Making in Resource-Constrained Environments

no code implementations • 13 Apr 2020 • Udari Madhushani, Naomi Ehrich Leonard

We study cost-effective communication strategies that can be used to improve the performance of distributed learning systems in resource-constrained environments.

Decision Making

Paper
Add Code

LagNetViP: A Lagrangian Neural Network for Video Prediction

no code implementations • 24 Oct 2020 • Christine Allen-Blanchette, Sushant Veer, Anirudha Majumdar, Naomi Ehrich Leonard

In this paper, we introduce a video prediction model where the equations of motion are explicitly constructed from learned representations of the underlying physical quantities.

Acrobot Video Prediction

Paper
Add Code

On Using Hamiltonian Monte Carlo Sampling for Reinforcement Learning Problems in High-dimension

no code implementations • 11 Nov 2020 • Udari Madhushani, Biswadip Dey, Naomi Ehrich Leonard, Amit Chakraborty

Value function based reinforcement learning (RL) algorithms, for example, $Q$-learning, learn optimal policies from datasets of actions, rewards, and state transitions.

Matrix Completion Q-Learning +2

Paper
Add Code

Distributed Bandits: Probabilistic Communication on $d$-regular Graphs

no code implementations • 16 Nov 2020 • Udari Madhushani, Naomi Ehrich Leonard

Every edge in the graph has probabilistic weight $p$ to account for the ($1\!-\! p$) probability of a communication link failure.

Paper
Add Code

Provably Efficient Multi-Agent Reinforcement Learning with Fully Decentralized Communication

no code implementations • 14 Oct 2021 • Justin Lidard, Udari Madhushani, Naomi Ehrich Leonard

Distributed exploration reduces sampling complexity in multi-agent RL (MARL).

Multi-agent Reinforcement Learning Q-Learning +2

Paper
Add Code

One More Step Towards Reality: Cooperative Bandits with Imperfect Communication

no code implementations • NeurIPS 2021 • Udari Madhushani, Abhimanyu Dubey, Naomi Ehrich Leonard, Alex Pentland

However, most research for this problem focuses exclusively on the setting with perfect communication, whereas in most real-world distributed settings, communication is often over stochastic networks, with arbitrary corruptions and delays.

Decision Making

Paper
Add Code

Learning with Delayed Payoffs in Population Games using Kullback-Leibler Divergence Regularization

no code implementations • 13 Jun 2023 • Shinkyu Park, Naomi Ehrich Leonard

Their goal is to learn the strategies of the Nash equilibrium of the game.

Decision Making

Paper
Add Code

Learning to Predict 3D Rotational Dynamics from Images of a Rigid Body with Unknown Mass Distribution

2 code implementations • 24 Aug 2023 • Justice Mason, Christine Allen-Blanchette, Nicholas Zolman, Elizabeth Davison, Naomi Ehrich Leonard

In many real-world settings, image observations of freely rotating 3D rigid bodies may be available when low-dimensional measurements are not.

Paper
Code

Active risk aversion in SIS epidemics on networks

no code implementations • 3 Nov 2023 • Anastasia Bizyaeva, Marcela Ordorica Arango, Yunxiu Zhou, Simon Levin, Naomi Ehrich Leonard

We prove that the model, with these two networks and populations using risk aversion strategies, exhibits a transcritical bifurcation in which an endemic equilibrium emerges.

Paper
Add Code

Sparse dynamic network reconstruction through L1-regularization of a Lyapunov equation

no code implementations • 8 Mar 2024 • Ian Xul Belaustegui, Marcela Ordorica Arango, Román Rossi-Pool, Naomi Ehrich Leonard, Alessio Franci

An important problem in many areas of science is that of recovering interaction networks from simultaneous time-series of many interacting dynamical processes.

Time Series

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.