2 code implementations • 25 Jun 2024 • Sebastian Dittert, Vincent Moens, Gianni de Fabritiis
The integration of TorchRL with the LEGO hubs, via Bluetooth bidirectional communication, enables state-of-the-art reinforcement learning training on GPUs for a wide variety of LEGO builds.
2 code implementations • 7 May 2024 • Albert Bou, Morgan Thomas, Sebastian Dittert, Carles Navarro Ramírez, Maciej Majewski, Ye Wang, Shivam Patel, Gary Tresadern, Mazen Ahmad, Vincent Moens, Woody Sherman, Simone Sciabola, Gianni de Fabritiis
In recent years, reinforcement learning (RL) has emerged as a valuable tool in drug design, offering the potential to propose and optimize molecules with desired properties.
1 code implementation • 3 Dec 2023 • Matteo Bettini, Amanda Prorok, Vincent Moens
The field of Multi-Agent Reinforcement Learning (MARL) is currently facing a reproducibility crisis.
2 code implementations • 1 Jun 2023 • Albert Bou, Matteo Bettini, Sebastian Dittert, Vikash Kumar, Shagun Sodhani, Xiaomeng Yang, Gianni de Fabritiis, Vincent Moens
PyTorch has ascended as a premier machine learning framework, yet it lacks a native and comprehensive library for decision and control tasks suitable for large development teams dealing with complex real-world data and environments.
1 code implementation • 12 Dec 2022 • Zhao Mandi, Homanga Bharadhwaj, Vincent Moens, Shuran Song, Aravind Rajeswaran, Vikash Kumar
On a real robot setup, CACTI enables efficient training of a single policy that can perform 10 manipulation tasks involving kitchen objects, and is robust to varying layouts of distractors.
no code implementations • 6 Jul 2021 • Vincent Moens, Aivar Sootla, Haitham Bou Ammar, Jun Wang
We present a method for conditional sampling for pre-trained normalizing flows when only part of an observation is available.
no code implementations • 15 Jan 2021 • Vincent Moens, Hang Ren, Alexandre Maraval, Rasul Tutunov, Jun Wang, Haitham Ammar
In this paper, we propose CI-VI an efficient and scalable solver for semi-implicit variational inference (SIVI).
1 code implementation • 12 Jun 2020 • Alexander I. Cowen-Rivers, Daniel Palenicek, Vincent Moens, Mohammed Abdullah, Aivar Sootla, Jun Wang, Haitham Ammar
In this paper, we propose SAMBA, a novel framework for safe reinforcement learning that combines aspects from probabilistic modelling, information theory, and statistics.
no code implementations • 27 Jan 2020 • Vincent Moens, Simiao Yu, Gholamreza Salimi-Khorshidi
This paper shows that most of the existing convolutional architectures define, at initialisation, a specific feature importance landscape that conditions their capacity to attend to different locations of the images later during training or even at test time.
no code implementations • ICML 2018 • Vincent Moens
A common problem in Machine Learning and statistics consists in detecting whether the current sample in a stream of data belongs to the same distribution as previous ones, is an isolated outlier or inaugurates a new distribution of data.