Search Results for author: Luca Viano

Found 8 papers, 4 papers with code

What can online reinforcement learning with function approximation benefit from general coverage conditions?

no code implementations • 25 Apr 2023 • Fanghui Liu, Luca Viano, Volkan Cevher

In online reinforcement learning (RL), instead of employing standard structural assumptions on Markov decision processes (MDPs), using a certain coverage condition (original from offline RL) is enough to ensure sample-efficient guarantees (Xie et al. 2023).

Offline RL Reinforcement Learning (RL)

Paper
Add Code

Identifiability and generalizability from multiple experts in Inverse Reinforcement Learning

1 code implementation • 22 Sep 2022 • Paul Rolland, Luca Viano, Norman Schuerhoff, Boris Nikolov, Volkan Cevher

While Reinforcement Learning (RL) aims to train an agent from a reward function in a given environment, Inverse Reinforcement Learning (IRL) seeks to recover the reward function from observing an expert's behavior.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Proximal Point Imitation Learning

2 code implementations • 22 Sep 2022 • Luca Viano, Angeliki Kamoutsi, Gergely Neu, Igor Krawczuk, Volkan Cevher

Thanks to PPM, we avoid nested policy evaluation and cost updates for online IL appearing in the prior literature.

Imitation Learning

Paper
Code

Understanding Deep Neural Function Approximation in Reinforcement Learning via $ε$-Greedy Exploration

no code implementations • 15 Sep 2022 • Fanghui Liu, Luca Viano, Volkan Cevher

To be specific, we focus on the value based algorithm with the $\epsilon$-greedy exploration via deep (and two-layer) neural networks endowed by Besov (and Barron) function spaces, respectively, which aims at approximating an $\alpha$-smooth Q-function in a $d$-dimensional feature space.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Neural NID Rules

no code implementations • 12 Feb 2022 • Luca Viano, Johanni Brea

Abstract object properties and their relations are deeply rooted in human common sense, allowing people to predict the dynamics of the world even in situations that are novel but governed by familiar laws of physics.

Common Sense Reasoning Model-based Reinforcement Learning +3

Paper
Add Code

Robust Learning from Observation with Model Misspecification

1 code implementation • 12 Feb 2022 • Luca Viano, Yu-Ting Huang, Parameswaran Kamalaruban, Craig Innes, Subramanian Ramamoorthy, Adrian Weller

Imitation learning (IL) is a popular paradigm for training policies in robotic systems when specifying the reward function is difficult.

Continuous Control Imitation Learning +1

Paper
Code

Sample-efficient actor-critic algorithms with an etiquette for zero-sum Markov games

no code implementations • 29 Sep 2021 • Ahmet Alacaoglu, Luca Viano, Niao He, Volkan Cevher

Our sample complexities also match the best-known results for global convergence of policy gradient and two time-scale actor-critic algorithms in the single agent setting.

Policy Gradient Methods

Paper
Add Code

Robust Inverse Reinforcement Learning under Transition Dynamics Mismatch

1 code implementation • NeurIPS 2021 • Luca Viano, Yu-Ting Huang, Parameswaran Kamalaruban, Adrian Weller, Volkan Cevher

We study the inverse reinforcement learning (IRL) problem under a transition dynamics mismatch between the expert and the learner.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.