Search Results for author: Renyuan Xu

Found 21 papers, 1 papers with code

Neural Network-Based Score Estimation in Diffusion Models: Optimization and Generalization

no code implementations28 Jan 2024 Yinbin Han, Meisam Razaviyayn, Renyuan Xu

Our analysis is grounded in a novel parametric form of the neural network and an innovative connection between score matching and regression analysis, facilitating the application of advanced statistical and optimization techniques.

Denoising regression

Fast Policy Learning for Linear Quadratic Control with Entropy Regularization

no code implementations23 Nov 2023 Xin Guo, Xinyu Li, Renyuan Xu

This paper proposes and analyzes two new policy learning methods: regularized policy gradient (RPG) and iterative policy optimization (IPO), for a class of discounted linear-quadratic control (LQC) problems over an infinite time horizon with entropy regularization.

Risk-sensitive Markov Decision Process and Learning under General Utility Functions

no code implementations22 Nov 2023 Zhengqi Wu, Renyuan Xu

In this paper, we consider a scenario where the decision-maker seeks to optimize a general utility function of the cumulative reward in the framework of a Markov decision process (MDP).

Reinforcement Learning (RL)

Policy Gradient Converges to the Globally Optimal Policy for Nearly Linear-Quadratic Regulators

no code implementations15 Mar 2023 Yinbin Han, Meisam Razaviyayn, Renyuan Xu

Nonlinear control systems with partial information to the decision maker are prevalent in a variety of applications.

Asymptotic Analysis of Deep Residual Networks

no code implementations15 Dec 2022 Rama Cont, Alain Rossier, Renyuan Xu

We investigate the asymptotic properties of deep Residual networks (ResNets) as the number of layers increases.

Risk-Aware Linear Bandits: Theory and Applications in Smart Order Routing

no code implementations4 Aug 2022 Jingwei Ji, Renyuan Xu, Ruihao Zhu

Then, we rigorously analyze their near-optimal regret upper bounds to show that, by leveraging the linear structure, our algorithms can dramatically reduce the regret when compared to existing methods.

Decision Making

Convergence and Implicit Regularization Properties of Gradient Descent for Deep Residual Networks

no code implementations14 Apr 2022 Rama Cont, Alain Rossier, Renyuan Xu

We prove linear convergence of gradient descent to a global optimum for the training of deep residual networks with constant layer width and smooth activation function.

Tail-GAN: Learning to Simulate Tail Risk Scenarios

no code implementations3 Mar 2022 Rama Cont, Mihai Cucuringu, Renyuan Xu, Chao Zhang

The estimation of loss distributions for dynamic portfolios requires the simulation of scenarios representing realistic joint dynamics of their components, with particular importance devoted to the simulation of tail risk scenarios.

Generative Adversarial Network

Recent Advances in Reinforcement Learning in Finance

no code implementations8 Dec 2021 Ben Hambly, Renyuan Xu, Huining Yang

In contrast to classical stochastic control theory and other analytical approaches for solving financial decision-making problems that heavily reply on model assumptions, new developments from reinforcement learning (RL) are able to make full use of the large amount of financial data with fewer model assumptions and to improve decisions in complex financial environments.

Decision Making Portfolio Optimization +2

Policy Gradient Methods Find the Nash Equilibrium in N-player General-sum Linear-quadratic Games

no code implementations27 Jul 2021 Ben Hambly, Renyuan Xu, Huining Yang

We consider a general-sum N-player linear-quadratic game with stochastic dynamics over a finite horizon and prove the global convergence of the natural policy gradient method to the Nash equilibrium.

Policy Gradient Methods

Scaling Properties of Deep Residual Networks

1 code implementation25 May 2021 Alain-Sam Cohen, Rama Cont, Alain Rossier, Renyuan Xu

Residual networks (ResNets) have displayed impressive results in pattern recognition and, recently, have garnered considerable theoretical interest due to a perceived link with neural ordinary differential equations (neural ODEs).

Policy Gradient Methods for the Noisy Linear Quadratic Regulator over a Finite Horizon

no code implementations20 Nov 2020 Ben Hambly, Renyuan Xu, Huining Yang

In particular, we consider the convergence of policy gradient methods in the setting of known and unknown parameters.

Policy Gradient Methods

Model-free Analysis of Dynamic Trading Strategies

no code implementations5 Nov 2020 Anna Ananova, Rama Cont, Renyuan Xu

We introduce a model-free approach based on excursions of trading signals for analyzing the risk and return for a broad class of dynamic trading strategies, including pairs trading and other statistical arbitrage strategies.

A General Framework for Learning Mean-Field Games

no code implementations13 Mar 2020 Xin Guo, Anran Hu, Renyuan Xu, Junzi Zhang

This paper presents a general mean-field game (GMFG) framework for simultaneous learning and decision-making in stochastic games with a large population.

Decision Making Multi-agent Reinforcement Learning +3

Delay-Adaptive Learning in Generalized Linear Contextual Bandits

no code implementations11 Mar 2020 Jose Blanchet, Renyuan Xu, Zhengyuan Zhou

In this paper, we consider online learning in generalized linear contextual bandits where rewards are not immediately observed.

Multi-Armed Bandits Thompson Sampling

Mean-Field Controls with Q-learning for Cooperative MARL: Convergence and Complexity Analysis

no code implementations10 Feb 2020 Haotian Gu, Xin Guo, Xiaoli Wei, Renyuan Xu

Multi-agent reinforcement learning (MARL), despite its popularity and empirical success, suffers from the curse of dimensionality.

Multi-agent Reinforcement Learning Q-Learning

Learning in Generalized Linear Contextual Bandits with Stochastic Delays

no code implementations NeurIPS 2019 Zhengyuan Zhou, Renyuan Xu, Jose Blanchet

In this paper, we consider online learning in generalized linear contextual bandits where rewards are not immediately observed.

Multi-Armed Bandits

Transaction Cost Analytics for Corporate Bonds

no code implementations21 Mar 2019 Xin Guo, Charles-Albert Lehalle, Renyuan Xu

This part is on the time scale of each transaction of liquid corporate bonds, and is by applying a transient impact model to estimate the price impact kernel using a non-parametric method.

A class of stochastic games and moving free boundary problems

no code implementations10 Sep 2018 Xin Guo, Wenpin Tang, Renyuan Xu

In this paper we propose and analyze a class of $N$-player stochastic games that include finite fuel stochastic games as a special case.

Cannot find the paper you are looking for? You can Submit a new open access paper.