Search Results for author: Pedro Cisneros-Velarde

Found 8 papers, 2 papers with code

Finite-sample Guarantees for Nash Q-learning with Linear Function Approximation

no code implementations1 Mar 2023 Pedro Cisneros-Velarde, Sanmi Koyejo

Nash Q-learning may be considered one of the first and most known algorithms in multi-agent reinforcement learning (MARL) for learning policies that constitute a Nash equilibrium of an underlying general-sum Markov game.

Multi-agent Reinforcement Learning Q-Learning

Restricted Strong Convexity of Deep Learning Models with Smooth Activations

no code implementations29 Sep 2022 Arindam Banerjee, Pedro Cisneros-Velarde, Libin Zhu, Mikhail Belkin

Second, we introduce a new analysis of optimization based on Restricted Strong Convexity (RSC) which holds as long as the squared norm of the average gradient of predictors is $\Omega(\frac{\text{poly}(L)}{\sqrt{m}})$ for the square loss.

Discrete State-Action Abstraction via the Successor Representation

1 code implementation7 Jun 2022 Amnon Attali, Pedro Cisneros-Velarde, Marco Morales, Nancy M. Amato

While the difficulty of reinforcement learning problems is typically related to the complexity of their state spaces, Abstraction proposes that solutions often lie in simpler underlying latent spaces.

Reinforcement Learning (RL) Transfer Learning

One Policy is Enough: Parallel Exploration with a Single Policy is Near-Optimal for Reward-Free Reinforcement Learning

no code implementations31 May 2022 Pedro Cisneros-Velarde, Boxiang Lyu, Sanmi Koyejo, Mladen Kolar

Although parallelism has been extensively used in reinforcement learning (RL), the quantitative effects of parallel exploration are not well understood theoretically.

Reinforcement Learning (RL)

A Contraction Theory Approach to Optimization Algorithms from Acceleration Flows

no code implementations18 May 2021 Pedro Cisneros-Velarde, Francesco Bullo

Much recent interest has focused on the design of optimization algorithms from the discretization of an associated optimization flow, i. e., a system of differential equations (ODEs) whose trajectories solve an associated optimization problem.

Distributed Wasserstein Barycenters via Displacement Interpolation

no code implementations15 Dec 2020 Pedro Cisneros-Velarde, Francesco Bullo

Consider a multi-agent system whereby each agent has an initial probability measure.

Sociology

Distributed and time-varying primal-dual dynamics via contraction analysis

no code implementations27 Mar 2020 Pedro Cisneros-Velarde, Saber Jafarpour, Francesco Bullo

In this note, we provide an overarching analysis of primal-dual dynamics associated to linear equality-constrained optimization problems using contraction analysis.

Distributed Optimization

Distributionally Robust Formulation and Model Selection for the Graphical Lasso

1 code implementation22 May 2019 Pedro Cisneros-Velarde, Sang-Yun Oh, Alexander Petersen

As a consequence of this formulation, the radius of the Wasserstein ambiguity set is directly related to the regularization parameter in the estimation problem.

Model Selection

Cannot find the paper you are looking for? You can Submit a new open access paper.