Search Results for author: Lukasz Szpruch

Found 19 papers, 5 papers with code

A Fisher-Rao gradient flow for entropy-regularised Markov decision processes in Polish spaces

no code implementations • 4 Oct 2023 • Bekzhan Kerimkulov, James-Michael Leahy, David Siska, Lukasz Szpruch, Yufei Zhang

We study the global convergence of a Fisher-Rao policy gradient flow for infinite-horizon entropy-regularised Markov decision processes with Polish state and action space.

LEMMA

Paper
Add Code

The AI Revolution: Opportunities and Challenges for the Finance Sector

no code implementations • 31 Aug 2023 • Carsten Maple, Lukasz Szpruch, Gregory Epiphaniou, Kalina Staykova, Simran Singh, William Penwarden, Yisi Wen, Zijian Wang, Jagdish Hariharan, Pavle Avramovic

A further issue identified in this report is the systemic risk that AI can introduce to the financial sector.

Fairness Fraud Detection

Paper
Add Code

Insurance pricing on price comparison websites via reinforcement learning

no code implementations • 14 Aug 2023 • Tanut Treetanthiploet, Yufei Zhang, Lukasz Szpruch, Isaac Bowers-Barnard, Henrietta Ridley, James Hickey, Chris Pearce

The emergence of price comparison websites (PCWs) has presented insurers with unique challenges in formulating effective pricing strategies.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

TAPAS: a Toolbox for Adversarial Privacy Auditing of Synthetic Data

2 code implementations • 12 Nov 2022 • Florimond Houssiau, James Jordon, Samuel N. Cohen, Owen Daniel, Andrew Elliott, James Geddes, Callum Mole, Camila Rangel-Smith, Lukasz Szpruch

We here present TAPAS, a toolbox of attacks to evaluate synthetic data privacy under a wide range of scenarios.

Decision Making

Paper
Code

Optimal scheduling of entropy regulariser for continuous-time linear-quadratic reinforcement learning

no code implementations • 8 Aug 2022 • Lukasz Szpruch, Tanut Treetanthiploet, Yufei Zhang

This work uses the entropy-regularised relaxed stochastic control perspective as a principled framework for designing reinforcement learning (RL) algorithms.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

Synthetic Data -- what, why and how?

no code implementations • 6 May 2022 • James Jordon, Lukasz Szpruch, Florimond Houssiau, Mirko Bottarelli, Giovanni Cherubin, Carsten Maple, Samuel N. Cohen, Adrian Weller

This explainer document aims to provide an overview of the current state of the rapidly expanding work on synthetic data technologies, with a particular focus on privacy.

Paper
Add Code

Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime

no code implementations • 18 Jan 2022 • Bekzhan Kerimkulov, James-Michael Leahy, David Šiška, Lukasz Szpruch

We show that the objective function is increasing along the gradient flow.

Paper
Add Code

Exploration-exploitation trade-off for continuous-time episodic reinforcement learning with linear-convex models

no code implementations • 19 Dec 2021 • Lukasz Szpruch, Tanut Treetanthiploet, Yufei Zhang

We develop a probabilistic framework for analysing model-based reinforcement learning in the episodic setting.

Model-based Reinforcement Learning Reinforcement Learning (RL)

Paper
Add Code

Sig-Wasserstein GANs for Time Series Generation

1 code implementation • 1 Nov 2021 • Hao Ni, Lukasz Szpruch, Marc Sabate-Vidales, Baoren Xiao, Magnus Wiese, Shujian Liao

Synthetic data is an emerging technology that can significantly accelerate the development and deployment of AI machine learning pipelines.

Time Series Time Series Analysis +1

Paper
Code

Identifiability in inverse reinforcement learning

no code implementations • NeurIPS 2021 • Haoyang Cao, Samuel N. Cohen, Lukasz Szpruch

Inverse reinforcement learning attempts to reconstruct the reward function in a Markov decision problem, using observations of agent actions.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Black-box model risk in finance

no code implementations • 9 Feb 2021 • Samuel N. Cohen, Derek Snow, Lukasz Szpruch

Machine learning models are increasingly used in a wide variety of financial settings.

BIG-bench Machine Learning Management

Paper
Add Code

Robust pricing and hedging via neural SDEs

1 code implementation • 8 Jul 2020 • Patryk Gierjatowicz, Marc Sabate-Vidales, David Šiška, Lukasz Szpruch, Žan Žurič

Combining neural networks with risk models based on classical stochastic differential equations (SDEs), we find robust bounds for prices of derivatives and the corresponding hedging strategies while incorporating relevant market data.

Model Selection

Paper
Code

Conditional Sig-Wasserstein GANs for Time Series Generation

2 code implementations • 9 Jun 2020 • Shujian Liao, Hao Ni, Lukasz Szpruch, Magnus Wiese, Marc Sabate-Vidales, Baoren Xiao

The signature of a path is a graded sequence of statistics that provides a universal description for a stream of data, and its expected value characterises the law of the time-series model.

Time Series Time Series Analysis +1

115

Paper
Code

Sig-SDEs model for quantitative finance

no code implementations • 30 May 2020 • Imanol Perez Arribas, Cristopher Salvi, Lukasz Szpruch

Mathematical models, calibrated to data, have become ubiquitous to make key decision processes in modern quantitative finance.

Model Selection Time Series +1

Paper
Add Code

Mean-Field Langevin Dynamics and Energy Landscape of Neural Networks

no code implementations • 19 May 2019 • Kaitong Hu, Zhenjie Ren, David Siska, Lukasz Szpruch

Our work is motivated by a desire to study the theoretical underpinning for the convergence of stochastic gradient type algorithms widely used for non-convex learning tasks such as training of neural networks.

Paper
Add Code

Unbiased deep solvers for linear parametric PDEs

2 code implementations • 11 Oct 2018 • Marc Sabate Vidales, David Siska, Lukasz Szpruch

We develop several deep learning algorithms for approximating families of parametric PDE solutions.

Paper
Code

Non-asymptotic bounds for sampling algorithms without log-concavity

no code implementations • 21 Aug 2018 • Mateusz B. Majka, Aleksandar Mijatović, Lukasz Szpruch

Finally, we provide a weak convergence analysis that covers both the standard and the randomised (inaccurate) drift case.

Paper
Add Code

Multilevel Monte Carlo for Scalable Bayesian Computations

no code implementations • 15 Sep 2016 • Mike Giles, Tigran Nagapetyan, Lukasz Szpruch, Sebastian Vollmer, Konstantinos Zygalakis

In contrast to MCMC methods, Stochastic Gradient MCMC (SGMCMC) algorithms such as the Stochastic Gradient Langevin Dynamics (SGLD) only require access to a batch of the data set at every step.

Paper
Add Code

Multilevel Monte Carlo methods for the approximation of invariant measures of stochastic differential equations

no code implementations • 4 May 2016 • Michael B. Giles, Mateusz B. Majka, Lukasz Szpruch, Sebastian Vollmer, Konstantinos Zygalakis

We show that this is the first stochastic gradient MCMC method with complexity $\mathcal{O}(\varepsilon^{-2}|\log {\varepsilon}|^{3})$, in contrast to the complexity $\mathcal{O}(\varepsilon^{-3})$ of currently available methods.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.