no code implementations • 19 Mar 2024 • Edoardo Cetin, Andrea Tirinzoni, Matteo Pirotta, Alessandro Lazaric, Yann Ollivier, Ahmed Touati
Offline reinforcement learning algorithms have proven effective on datasets highly connected to the target downstream task.
no code implementations • ICCV 2023 • Edoardo Cetin, Antonio Carta, Oya Celiktutan
Meta-learning holds the potential to provide a general and explicit solution to tackle interference and forgetting in continual learning.
no code implementations • 13 Oct 2022 • Edoardo Cetin, Oya Celiktutan
We model agent behavior as the steady-state distribution of a parameterized reasoning Markov chain (RMC), optimized with a new tractable estimate of the policy gradient.
no code implementations • 4 Oct 2022 • Edoardo Cetin, Benjamin Chamberlain, Michael Bronstein, Jonathan J Hunt
We propose a new class of deep reinforcement learning (RL) algorithms that model latent representations in hyperbolic space.
1 code implementation • 3 Jul 2022 • Edoardo Cetin, Philip J. Ball, Steve Roberts, Oya Celiktutan
Off-policy reinforcement learning (RL) from pixel observations is notoriously unstable.
no code implementations • 7 Oct 2021 • Edoardo Cetin, Oya Celiktutan
Off-policy deep reinforcement learning algorithms commonly compensate for overestimation bias during temporal-difference learning by utilizing pessimistic estimates of the expected target returns.
no code implementations • 5 Jun 2021 • Edoardo Cetin, Oya Celiktutan
Within our framework, agents learn effective behavior over a routine space: a new, higher-level action space, where each routine represents a set of 'equivalent' sequences of granular actions with arbitrary length.
no code implementations • 21 Apr 2021 • Jian Jiang, Edoardo Cetin, Oya Celiktutan
However, finding a trade-off between the model performance and the number of samples to save for each class is still an open problem for replay-based incremental learning and is increasingly desirable for real-life applications.
1 code implementation • ICLR 2021 • Edoardo Cetin, Oya Celiktutan
Human beings are able to understand objectives and learn by simply observing others perform a task.