Search Results for author: Luca Viano

Found 8 papers, 4 papers with code

What can online reinforcement learning with function approximation benefit from general coverage conditions?

no code implementations25 Apr 2023 Fanghui Liu, Luca Viano, Volkan Cevher

In online reinforcement learning (RL), instead of employing standard structural assumptions on Markov decision processes (MDPs), using a certain coverage condition (original from offline RL) is enough to ensure sample-efficient guarantees (Xie et al. 2023).

Offline RL Reinforcement Learning (RL)

Identifiability and generalizability from multiple experts in Inverse Reinforcement Learning

1 code implementation22 Sep 2022 Paul Rolland, Luca Viano, Norman Schuerhoff, Boris Nikolov, Volkan Cevher

While Reinforcement Learning (RL) aims to train an agent from a reward function in a given environment, Inverse Reinforcement Learning (IRL) seeks to recover the reward function from observing an expert's behavior.

reinforcement-learning Reinforcement Learning (RL)

Proximal Point Imitation Learning

2 code implementations22 Sep 2022 Luca Viano, Angeliki Kamoutsi, Gergely Neu, Igor Krawczuk, Volkan Cevher

Thanks to PPM, we avoid nested policy evaluation and cost updates for online IL appearing in the prior literature.

Imitation Learning

Understanding Deep Neural Function Approximation in Reinforcement Learning via $ε$-Greedy Exploration

no code implementations15 Sep 2022 Fanghui Liu, Luca Viano, Volkan Cevher

To be specific, we focus on the value based algorithm with the $\epsilon$-greedy exploration via deep (and two-layer) neural networks endowed by Besov (and Barron) function spaces, respectively, which aims at approximating an $\alpha$-smooth Q-function in a $d$-dimensional feature space.

reinforcement-learning Reinforcement Learning (RL)

Neural NID Rules

no code implementations12 Feb 2022 Luca Viano, Johanni Brea

Abstract object properties and their relations are deeply rooted in human common sense, allowing people to predict the dynamics of the world even in situations that are novel but governed by familiar laws of physics.

Common Sense Reasoning Model-based Reinforcement Learning +3

Robust Learning from Observation with Model Misspecification

1 code implementation12 Feb 2022 Luca Viano, Yu-Ting Huang, Parameswaran Kamalaruban, Craig Innes, Subramanian Ramamoorthy, Adrian Weller

Imitation learning (IL) is a popular paradigm for training policies in robotic systems when specifying the reward function is difficult.

Continuous Control Imitation Learning +1

Sample-efficient actor-critic algorithms with an etiquette for zero-sum Markov games

no code implementations29 Sep 2021 Ahmet Alacaoglu, Luca Viano, Niao He, Volkan Cevher

Our sample complexities also match the best-known results for global convergence of policy gradient and two time-scale actor-critic algorithms in the single agent setting.

Policy Gradient Methods

Cannot find the paper you are looking for? You can Submit a new open access paper.