Search Results for author: Bryan Lim

Found 28 papers, 9 papers with code

Beyond Expected Return: Accounting for Policy Reproducibility when Evaluating Reinforcement Learning Algorithms

no code implementations • 12 Dec 2023 • Manon Flageat, Bryan Lim, Antoine Cully

We highlight that existing procedures that only use the expected return are limited on two fronts: first an infinite number of return distributions with a wide range of performance-reproducibility trade-offs can have the same expected return, limiting its effectiveness when used for comparing policies; second, the expected return metric does not leave any room for practitioners to choose the best trade-off value for considered applications.

Bayesian Optimisation Reinforcement Learning (RL)

Paper
Add Code

Mix-ME: Quality-Diversity for Multi-Agent Learning

no code implementations • 3 Nov 2023 • Garðar Ingvarsson, Mikayel Samvelyan, Bryan Lim, Manon Flageat, Antoine Cully, Tim Rocktäschel

In many real-world systems, such as adaptive robotics, achieving a single, optimised solution may be insufficient.

Continuous Control

Paper
Add Code

QDax: A Library for Quality-Diversity and Population-based Algorithms with Hardware Acceleration

1 code implementation • 7 Aug 2023 • Felix Chalumeau, Bryan Lim, Raphael Boige, Maxime Allard, Luca Grillotti, Manon Flageat, Valentin Macé, Arthur Flajolet, Thomas Pierrot, Antoine Cully

QDax is an open-source library with a streamlined and modular API for Quality-Diversity (QD) optimization algorithms in Jax.

Continuous Control reinforcement-learning +1

243

Paper
Code

Quality-Diversity Optimisation on a Physical Robot Through Dynamics-Aware and Reset-Free Learning

no code implementations • 24 Apr 2023 • Simón C. Smith, Bryan Lim, Hannah Janmohamed, Antoine Cully

This method uses a dynamics model, learned from interactions between the robot and the environment, to predict the robot's behaviour and improve sample efficiency.

Paper
Add Code

Don't Bet on Luck Alone: Enhancing Behavioral Reproducibility of Quality-Diversity Solutions in Uncertain Domains

no code implementations • 7 Apr 2023 • Luca Grillotti, Manon Flageat, Bryan Lim, Antoine Cully

Quality-Diversity (QD) algorithms are designed to generate collections of high-performing solutions while maximizing their diversity in a given descriptor space.

Paper
Add Code

Enhancing MAP-Elites with Multiple Parallel Evolution Strategies

no code implementations • 10 Mar 2023 • Manon Flageat, Bryan Lim, Antoine Cully

With the development of fast and massively parallel evaluations in many domains, Quality-Diversity (QD) algorithms, that already proved promising in a large range of applications, have seen their potential multiplied.

Paper
Add Code

Understanding the Synergies between Quality-Diversity and Deep Reinforcement Learning

no code implementations • 10 Mar 2023 • Bryan Lim, Manon Flageat, Antoine Cully

However, we also find that not all insights from Deep RL can be effectively translated to QD-RL.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Efficient Exploration using Model-Based Quality-Diversity with Gradients

no code implementations • 22 Nov 2022 • Bryan Lim, Manon Flageat, Antoine Cully

Methods such as Quality-Diversity deals with this by encouraging novel solutions and producing a diversity of behaviours.

Efficient Exploration

Paper
Add Code

Benchmarking Quality-Diversity Algorithms on Neuroevolution for Reinforcement Learning

1 code implementation • 4 Nov 2022 • Manon Flageat, Bryan Lim, Luca Grillotti, Maxime Allard, Simón C. Smith, Antoine Cully

We present a Quality-Diversity benchmark suite for Deep Neuroevolution in Reinforcement Learning domains for robot control.

Benchmarking reinforcement-learning +1

243

Paper
Code

Online Damage Recovery for Physical Robots with Hierarchical Quality-Diversity

no code implementations • 18 Oct 2022 • Maxime Allard, Simón C. Smith, Konstantinos Chatzilygeroudis, Bryan Lim, Antoine Cully

Quality-Diversity (QD) algorithms have been successfully used to make robots adapt to damages in seconds by leveraging a diverse set of learned skills.

Paper
Add Code

Efficient Learning of Locomotion Skills through the Discovery of Diverse Environmental Trajectory Generator Priors

no code implementations • 10 Oct 2022 • Shikha Surana, Bryan Lim, Antoine Cully

Data-driven learning based methods have recently been particularly successful at learning robust locomotion controllers for a variety of unstructured terrains.

Paper
Add Code

Neuroevolution is a Competitive Alternative to Reinforcement Learning for Skill Discovery

1 code implementation • 6 Oct 2022 • Felix Chalumeau, Raphael Boige, Bryan Lim, Valentin Macé, Maxime Allard, Arthur Flajolet, Antoine Cully, Thomas Pierrot

Recent work has shown that training a mixture of policies, as opposed to a single one, that are driven to explore different regions of the state-action space can address this shortcoming by generating a diverse set of behaviors, referred to as skills, that can be collectively used to great effect in adaptation tasks or for hierarchical planning.

reinforcement-learning Reinforcement Learning (RL)

Paper
Code

Learning to Walk Autonomously via Reset-Free Quality-Diversity

no code implementations • 7 Apr 2022 • Bryan Lim, Alexander Reichenbach, Antoine Cully

Quality-Diversity (QD) algorithms can discover large and complex behavioural repertoires consisting of both diverse and high-performing skills.

Paper
Add Code

Accelerated Quality-Diversity through Massive Parallelism

2 code implementations • 2 Feb 2022 • Bryan Lim, Maxime Allard, Luca Grillotti, Antoine Cully

With recent advances in simulators that run on accelerators, thousands of evaluations can now be performed in parallel on single GPU/TPU.

783

Paper
Code

Dynamics-Aware Quality-Diversity for Efficient Learning of Skill Repertoires

no code implementations • 16 Sep 2021 • Bryan Lim, Luca Grillotti, Lorenzo Bernasconi, Antoine Cully

In this paper, we propose Dynamics-Aware Quality-Diversity (DA-QD), a framework to improve the sample efficiency of QD algorithms through the use of dynamics models.

Zero-Shot Learning

Paper
Add Code

Enhancing Cross-Sectional Currency Strategies by Context-Aware Learning to Rank with Self-Attention

1 code implementation • 20 May 2021 • Daniel Poh, Bryan Lim, Stefan Zohren, Stephen Roberts

The performance of a cross-sectional currency strategy depends crucially on accurately ranking instruments prior to portfolio construction.

Information Retrieval Learning-To-Rank +2

800

Paper
Code

Deep Learning for Market by Order Data

no code implementations • 17 Feb 2021 • Zihao Zhang, Bryan Lim, Stefan Zohren

Market by order (MBO) data - a detailed feed of individual trade instructions for a given stock on an exchange - is arguably one of the most granular sources of microstructure information.

Paper
Add Code

Building Cross-Sectional Systematic Strategies By Learning to Rank

no code implementations • 13 Dec 2020 • Daniel Poh, Bryan Lim, Stefan Zohren, Stephen Roberts

The success of a cross-sectional systematic strategy depends critically on accurately ranking assets prior to portfolio construction.

Information Retrieval Learning-To-Rank +1

Paper
Add Code

Tactile Object Pose Estimation from the First Touch with Geometric Contact Rendering

no code implementations • 9 Dec 2020 • Maria Bauza, Eric Valls, Bryan Lim, Theo Sechopoulos, Alberto Rodriguez

In this paper, we present an approach to tactile pose estimation from the first touch for known objects.

Contrastive Learning Object +1

Paper
Add Code

Time Series Forecasting With Deep Learning: A Survey

no code implementations • 28 Apr 2020 • Bryan Lim, Stefan Zohren

Numerous deep learning architectures have been developed to accommodate the diversity of time series datasets across different domains.

Time Series Time Series Forecasting

Paper
Add Code

Detecting Changes in Asset Co-Movement Using the Autoencoder Reconstruction Ratio

no code implementations • 23 Jan 2020 • Bryan Lim, Stefan Zohren, Stephen Roberts

Detecting changes in asset co-movements is of much importance to financial practitioners, with numerous risk management benefits arising from the timely detection of breakdowns in historical correlations.

Denoising Dimensionality Reduction +1

Paper
Add Code

Temporal Fusion Transformers for Interpretable Multi-horizon Time Series Forecasting

34 code implementations • 19 Dec 2019 • Bryan Lim, Sercan O. Arik, Nicolas Loeff, Tomas Pfister

Multi-horizon forecasting problems often contain a complex mix of inputs -- including static (i. e. time-invariant) covariates, known future inputs, and other exogenous time series that are only observed historically -- without any prior information on how they interact with the target.

Interpretable Machine Learning Time Series +1

32,796

Paper
Code

Population-based Global Optimisation Methods for Learning Long-term Dependencies with RNNs

no code implementations • 23 May 2019 • Bryan Lim, Stefan Zohren, Stephen Roberts

Despite recent innovations in network architectures and loss functions, training RNNs to learn long-term dependencies remains difficult due to challenges with gradient-based optimisation methods.

reinforcement-learning Reinforcement Learning (RL) +2

Paper
Add Code

Enhancing Time Series Momentum Strategies Using Deep Neural Networks

1 code implementation • 9 Apr 2019 • Bryan Lim, Stefan Zohren, Stephen Roberts

While time series momentum is a well-studied phenomenon in finance, common strategies require the explicit definition of both a trend estimator and a position sizing rule.

Position Time Series +1

Paper
Code

Recurrent Neural Filters: Learning Independent Bayesian Filtering Steps for Time Series Prediction

1 code implementation • 23 Jan 2019 • Bryan Lim, Stefan Zohren, Stephen Roberts

Testing this on three real-world time series datasets, we demonstrate that the decoupled representations learnt not only improve the accuracy of one-step-ahead forecasts while providing realistic uncertainty estimates, but also facilitate multistep prediction through the separation of encoder stages.

Time Series Time Series Prediction

Paper
Code

Forecasting Treatment Responses Over Time Using Recurrent Marginal Structural Networks

1 code implementation • NeurIPS 2018 • Bryan Lim

Electronic health records provide a rich source of data for machine learning methods to learn dynamic treatment responses over time.

Epidemiology

Paper
Code

Forecasting Disease Trajectories in Alzheimer's Disease Using Deep Learning

no code implementations • 6 Jul 2018 • Bryan Lim, Mihaela van der Schaar

Joint models for longitudinal and time-to-event data are commonly used in longitudinal studies to forecast disease trajectories over time.

Paper
Add Code

Disease-Atlas: Navigating Disease Trajectories with Deep Learning

no code implementations • 27 Mar 2018 • Bryan Lim, Mihaela van der Schaar

Joint models for longitudinal and time-to-event data are commonly used in longitudinal studies to forecast disease trajectories over time.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.