Search Results for author: Lukasz Szpruch

Found 19 papers, 5 papers with code

A Fisher-Rao gradient flow for entropy-regularised Markov decision processes in Polish spaces

no code implementations4 Oct 2023 Bekzhan Kerimkulov, James-Michael Leahy, David Siska, Lukasz Szpruch, Yufei Zhang

We study the global convergence of a Fisher-Rao policy gradient flow for infinite-horizon entropy-regularised Markov decision processes with Polish state and action space.

LEMMA

Optimal scheduling of entropy regulariser for continuous-time linear-quadratic reinforcement learning

no code implementations8 Aug 2022 Lukasz Szpruch, Tanut Treetanthiploet, Yufei Zhang

This work uses the entropy-regularised relaxed stochastic control perspective as a principled framework for designing reinforcement learning (RL) algorithms.

reinforcement-learning Reinforcement Learning (RL) +1

Synthetic Data -- what, why and how?

no code implementations6 May 2022 James Jordon, Lukasz Szpruch, Florimond Houssiau, Mirko Bottarelli, Giovanni Cherubin, Carsten Maple, Samuel N. Cohen, Adrian Weller

This explainer document aims to provide an overview of the current state of the rapidly expanding work on synthetic data technologies, with a particular focus on privacy.

Sig-Wasserstein GANs for Time Series Generation

1 code implementation1 Nov 2021 Hao Ni, Lukasz Szpruch, Marc Sabate-Vidales, Baoren Xiao, Magnus Wiese, Shujian Liao

Synthetic data is an emerging technology that can significantly accelerate the development and deployment of AI machine learning pipelines.

Time Series Time Series Analysis +1

Identifiability in inverse reinforcement learning

no code implementations NeurIPS 2021 Haoyang Cao, Samuel N. Cohen, Lukasz Szpruch

Inverse reinforcement learning attempts to reconstruct the reward function in a Markov decision problem, using observations of agent actions.

reinforcement-learning Reinforcement Learning (RL)

Black-box model risk in finance

no code implementations9 Feb 2021 Samuel N. Cohen, Derek Snow, Lukasz Szpruch

Machine learning models are increasingly used in a wide variety of financial settings.

BIG-bench Machine Learning Management

Robust pricing and hedging via neural SDEs

1 code implementation8 Jul 2020 Patryk Gierjatowicz, Marc Sabate-Vidales, David Šiška, Lukasz Szpruch, Žan Žurič

Combining neural networks with risk models based on classical stochastic differential equations (SDEs), we find robust bounds for prices of derivatives and the corresponding hedging strategies while incorporating relevant market data.

Model Selection

Conditional Sig-Wasserstein GANs for Time Series Generation

2 code implementations9 Jun 2020 Shujian Liao, Hao Ni, Lukasz Szpruch, Magnus Wiese, Marc Sabate-Vidales, Baoren Xiao

The signature of a path is a graded sequence of statistics that provides a universal description for a stream of data, and its expected value characterises the law of the time-series model.

Time Series Time Series Analysis +1

Sig-SDEs model for quantitative finance

no code implementations30 May 2020 Imanol Perez Arribas, Cristopher Salvi, Lukasz Szpruch

Mathematical models, calibrated to data, have become ubiquitous to make key decision processes in modern quantitative finance.

Model Selection Time Series +1

Mean-Field Langevin Dynamics and Energy Landscape of Neural Networks

no code implementations19 May 2019 Kaitong Hu, Zhenjie Ren, David Siska, Lukasz Szpruch

Our work is motivated by a desire to study the theoretical underpinning for the convergence of stochastic gradient type algorithms widely used for non-convex learning tasks such as training of neural networks.

Unbiased deep solvers for linear parametric PDEs

2 code implementations11 Oct 2018 Marc Sabate Vidales, David Siska, Lukasz Szpruch

We develop several deep learning algorithms for approximating families of parametric PDE solutions.

Non-asymptotic bounds for sampling algorithms without log-concavity

no code implementations21 Aug 2018 Mateusz B. Majka, Aleksandar Mijatović, Lukasz Szpruch

Finally, we provide a weak convergence analysis that covers both the standard and the randomised (inaccurate) drift case.

Multilevel Monte Carlo for Scalable Bayesian Computations

no code implementations15 Sep 2016 Mike Giles, Tigran Nagapetyan, Lukasz Szpruch, Sebastian Vollmer, Konstantinos Zygalakis

In contrast to MCMC methods, Stochastic Gradient MCMC (SGMCMC) algorithms such as the Stochastic Gradient Langevin Dynamics (SGLD) only require access to a batch of the data set at every step.

Multilevel Monte Carlo methods for the approximation of invariant measures of stochastic differential equations

no code implementations4 May 2016 Michael B. Giles, Mateusz B. Majka, Lukasz Szpruch, Sebastian Vollmer, Konstantinos Zygalakis

We show that this is the first stochastic gradient MCMC method with complexity $\mathcal{O}(\varepsilon^{-2}|\log {\varepsilon}|^{3})$, in contrast to the complexity $\mathcal{O}(\varepsilon^{-3})$ of currently available methods.

Cannot find the paper you are looking for? You can Submit a new open access paper.