Search Results for author: Pedro Cisneros-Velarde

Found 8 papers, 2 papers with code

Finite-sample Guarantees for Nash Q-learning with Linear Function Approximation

no code implementations • 1 Mar 2023 • Pedro Cisneros-Velarde, Sanmi Koyejo

Nash Q-learning may be considered one of the first and most known algorithms in multi-agent reinforcement learning (MARL) for learning policies that constitute a Nash equilibrium of an underlying general-sum Markov game.

Multi-agent Reinforcement Learning Q-Learning

Paper
Add Code

Restricted Strong Convexity of Deep Learning Models with Smooth Activations

no code implementations • 29 Sep 2022 • Arindam Banerjee, Pedro Cisneros-Velarde, Libin Zhu, Mikhail Belkin

Second, we introduce a new analysis of optimization based on Restricted Strong Convexity (RSC) which holds as long as the squared norm of the average gradient of predictors is $\Omega(\frac{\text{poly}(L)}{\sqrt{m}})$ for the square loss.

Paper
Add Code

Discrete State-Action Abstraction via the Successor Representation

1 code implementation • 7 Jun 2022 • Amnon Attali, Pedro Cisneros-Velarde, Marco Morales, Nancy M. Amato

While the difficulty of reinforcement learning problems is typically related to the complexity of their state spaces, Abstraction proposes that solutions often lie in simpler underlying latent spaces.

Reinforcement Learning (RL) Transfer Learning

Paper
Code

One Policy is Enough: Parallel Exploration with a Single Policy is Near-Optimal for Reward-Free Reinforcement Learning

no code implementations • 31 May 2022 • Pedro Cisneros-Velarde, Boxiang Lyu, Sanmi Koyejo, Mladen Kolar

Although parallelism has been extensively used in reinforcement learning (RL), the quantitative effects of parallel exploration are not well understood theoretically.

Reinforcement Learning (RL)

Paper
Add Code

A Contraction Theory Approach to Optimization Algorithms from Acceleration Flows

no code implementations • 18 May 2021 • Pedro Cisneros-Velarde, Francesco Bullo

Much recent interest has focused on the design of optimization algorithms from the discretization of an associated optimization flow, i. e., a system of differential equations (ODEs) whose trajectories solve an associated optimization problem.

Paper
Add Code

Distributed Wasserstein Barycenters via Displacement Interpolation

no code implementations • 15 Dec 2020 • Pedro Cisneros-Velarde, Francesco Bullo

Consider a multi-agent system whereby each agent has an initial probability measure.

Sociology

Paper
Add Code

Distributed and time-varying primal-dual dynamics via contraction analysis

no code implementations • 27 Mar 2020 • Pedro Cisneros-Velarde, Saber Jafarpour, Francesco Bullo

In this note, we provide an overarching analysis of primal-dual dynamics associated to linear equality-constrained optimization problems using contraction analysis.

Distributed Optimization

Paper
Add Code

Distributionally Robust Formulation and Model Selection for the Graphical Lasso

1 code implementation • 22 May 2019 • Pedro Cisneros-Velarde, Sang-Yun Oh, Alexander Petersen

As a consequence of this formulation, the radius of the Wasserstein ambiguity set is directly related to the regularization parameter in the estimation problem.

Model Selection

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.