no code implementations • 12 Dec 2023 • Manon Flageat, Bryan Lim, Antoine Cully
We highlight that existing procedures that only use the expected return are limited on two fronts: first an infinite number of return distributions with a wide range of performance-reproducibility trade-offs can have the same expected return, limiting its effectiveness when used for comparing policies; second, the expected return metric does not leave any room for practitioners to choose the best trade-off value for considered applications.
no code implementations • 3 Nov 2023 • Garðar Ingvarsson, Mikayel Samvelyan, Bryan Lim, Manon Flageat, Antoine Cully, Tim Rocktäschel
In many real-world systems, such as adaptive robotics, achieving a single, optimised solution may be insufficient.
1 code implementation • 7 Aug 2023 • Felix Chalumeau, Bryan Lim, Raphael Boige, Maxime Allard, Luca Grillotti, Manon Flageat, Valentin Macé, Arthur Flajolet, Thomas Pierrot, Antoine Cully
QDax is an open-source library with a streamlined and modular API for Quality-Diversity (QD) optimization algorithms in Jax.
no code implementations • 24 Apr 2023 • Simón C. Smith, Bryan Lim, Hannah Janmohamed, Antoine Cully
This method uses a dynamics model, learned from interactions between the robot and the environment, to predict the robot's behaviour and improve sample efficiency.
no code implementations • 7 Apr 2023 • Luca Grillotti, Manon Flageat, Bryan Lim, Antoine Cully
Quality-Diversity (QD) algorithms are designed to generate collections of high-performing solutions while maximizing their diversity in a given descriptor space.
no code implementations • 10 Mar 2023 • Manon Flageat, Bryan Lim, Antoine Cully
With the development of fast and massively parallel evaluations in many domains, Quality-Diversity (QD) algorithms, that already proved promising in a large range of applications, have seen their potential multiplied.
no code implementations • 10 Mar 2023 • Bryan Lim, Manon Flageat, Antoine Cully
However, we also find that not all insights from Deep RL can be effectively translated to QD-RL.
no code implementations • 22 Nov 2022 • Bryan Lim, Manon Flageat, Antoine Cully
Methods such as Quality-Diversity deals with this by encouraging novel solutions and producing a diversity of behaviours.
1 code implementation • 4 Nov 2022 • Manon Flageat, Bryan Lim, Luca Grillotti, Maxime Allard, Simón C. Smith, Antoine Cully
We present a Quality-Diversity benchmark suite for Deep Neuroevolution in Reinforcement Learning domains for robot control.
no code implementations • 18 Oct 2022 • Maxime Allard, Simón C. Smith, Konstantinos Chatzilygeroudis, Bryan Lim, Antoine Cully
Quality-Diversity (QD) algorithms have been successfully used to make robots adapt to damages in seconds by leveraging a diverse set of learned skills.
no code implementations • 10 Oct 2022 • Shikha Surana, Bryan Lim, Antoine Cully
Data-driven learning based methods have recently been particularly successful at learning robust locomotion controllers for a variety of unstructured terrains.
1 code implementation • 6 Oct 2022 • Felix Chalumeau, Raphael Boige, Bryan Lim, Valentin Macé, Maxime Allard, Arthur Flajolet, Antoine Cully, Thomas Pierrot
Recent work has shown that training a mixture of policies, as opposed to a single one, that are driven to explore different regions of the state-action space can address this shortcoming by generating a diverse set of behaviors, referred to as skills, that can be collectively used to great effect in adaptation tasks or for hierarchical planning.
no code implementations • 7 Apr 2022 • Bryan Lim, Alexander Reichenbach, Antoine Cully
Quality-Diversity (QD) algorithms can discover large and complex behavioural repertoires consisting of both diverse and high-performing skills.
2 code implementations • 2 Feb 2022 • Bryan Lim, Maxime Allard, Luca Grillotti, Antoine Cully
With recent advances in simulators that run on accelerators, thousands of evaluations can now be performed in parallel on single GPU/TPU.
no code implementations • 16 Sep 2021 • Bryan Lim, Luca Grillotti, Lorenzo Bernasconi, Antoine Cully
In this paper, we propose Dynamics-Aware Quality-Diversity (DA-QD), a framework to improve the sample efficiency of QD algorithms through the use of dynamics models.
1 code implementation • 20 May 2021 • Daniel Poh, Bryan Lim, Stefan Zohren, Stephen Roberts
The performance of a cross-sectional currency strategy depends crucially on accurately ranking instruments prior to portfolio construction.
no code implementations • 17 Feb 2021 • Zihao Zhang, Bryan Lim, Stefan Zohren
Market by order (MBO) data - a detailed feed of individual trade instructions for a given stock on an exchange - is arguably one of the most granular sources of microstructure information.
no code implementations • 13 Dec 2020 • Daniel Poh, Bryan Lim, Stefan Zohren, Stephen Roberts
The success of a cross-sectional systematic strategy depends critically on accurately ranking assets prior to portfolio construction.
no code implementations • 9 Dec 2020 • Maria Bauza, Eric Valls, Bryan Lim, Theo Sechopoulos, Alberto Rodriguez
In this paper, we present an approach to tactile pose estimation from the first touch for known objects.
no code implementations • 28 Apr 2020 • Bryan Lim, Stefan Zohren
Numerous deep learning architectures have been developed to accommodate the diversity of time series datasets across different domains.
no code implementations • 23 Jan 2020 • Bryan Lim, Stefan Zohren, Stephen Roberts
Detecting changes in asset co-movements is of much importance to financial practitioners, with numerous risk management benefits arising from the timely detection of breakdowns in historical correlations.
34 code implementations • 19 Dec 2019 • Bryan Lim, Sercan O. Arik, Nicolas Loeff, Tomas Pfister
Multi-horizon forecasting problems often contain a complex mix of inputs -- including static (i. e. time-invariant) covariates, known future inputs, and other exogenous time series that are only observed historically -- without any prior information on how they interact with the target.
no code implementations • 23 May 2019 • Bryan Lim, Stefan Zohren, Stephen Roberts
Despite recent innovations in network architectures and loss functions, training RNNs to learn long-term dependencies remains difficult due to challenges with gradient-based optimisation methods.
1 code implementation • 9 Apr 2019 • Bryan Lim, Stefan Zohren, Stephen Roberts
While time series momentum is a well-studied phenomenon in finance, common strategies require the explicit definition of both a trend estimator and a position sizing rule.
1 code implementation • 23 Jan 2019 • Bryan Lim, Stefan Zohren, Stephen Roberts
Testing this on three real-world time series datasets, we demonstrate that the decoupled representations learnt not only improve the accuracy of one-step-ahead forecasts while providing realistic uncertainty estimates, but also facilitate multistep prediction through the separation of encoder stages.
1 code implementation • NeurIPS 2018 • Bryan Lim
Electronic health records provide a rich source of data for machine learning methods to learn dynamic treatment responses over time.
no code implementations • 6 Jul 2018 • Bryan Lim, Mihaela van der Schaar
Joint models for longitudinal and time-to-event data are commonly used in longitudinal studies to forecast disease trajectories over time.
no code implementations • 27 Mar 2018 • Bryan Lim, Mihaela van der Schaar
Joint models for longitudinal and time-to-event data are commonly used in longitudinal studies to forecast disease trajectories over time.