Search Results for author: Xuefeng Gao

Found 18 papers, 0 papers with code

Convergence Analysis for General Probability Flow ODEs of Diffusion Models in Wasserstein Distances

no code implementations31 Jan 2024 Xuefeng Gao, Lingjiong Zhu

Score-based generative modeling with probability flow ordinary differential equations (ODEs) has achieved remarkable success in a variety of applications.

Wasserstein Convergence Guarantees for a General Class of Score-Based Generative Models

no code implementations18 Nov 2023 Xuefeng Gao, Hoang M. Nguyen, Lingjiong Zhu

We find that the experimental results are in good agreement with our theoretical predictions on the iteration complexity, and the models with our newly proposed forward processes can outperform existing models.

Image Generation Unconditional Image Generation

Regret Bounds for Markov Decision Processes with Recursive Optimized Certainty Equivalents

no code implementations30 Jan 2023 Wenhao Xu, Xuefeng Gao, Xuedong He

The optimized certainty equivalent (OCE) is a family of risk measures that cover important examples such as entropic risk, conditional value-at-risk and mean-variance models.

reinforcement-learning Reinforcement Learning (RL)

Square-root regret bounds for continuous-time episodic Markov decision processes

no code implementations3 Oct 2022 Xuefeng Gao, Xun Yu Zhou

We study reinforcement learning for continuous-time Markov decision processes (MDPs) in the finite-horizon episodic setting.

reinforcement-learning Reinforcement Learning (RL)

Logarithmic regret bounds for continuous-time average-reward Markov decision processes

no code implementations23 May 2022 Xuefeng Gao, Xun Yu Zhou

We consider reinforcement learning for continuous-time Markov decision processes (MDPs) in the infinite-horizon, average-reward setting.

Point Processes reinforcement-learning +1

Debiasing Samples from Online Learning Using Bootstrap

no code implementations31 Jul 2021 Ningyuan Chen, Xuefeng Gao, Yi Xiong

It has been recently shown in the literature that the sample averages from online learning experiments are biased when used to estimate the mean reward.

Off-policy evaluation Thompson Sampling

Sublinear Regret for Learning POMDPs

no code implementations8 Jul 2021 Yi Xiong, Ningyuan Chen, Xuefeng Gao, Xiang Zhou

We study the model-based undiscounted reinforcement learning for partially observable Markov decision processes (POMDPs).

reinforcement-learning Reinforcement Learning (RL)

Breaking Reversibility Accelerates Langevin Dynamics for Non-Convex Optimization

no code implementations NeurIPS 2020 Xuefeng Gao, Mert Gurbuzbalaban, Lingjiong Zhu

We study two variants that are based on non-reversible Langevin diffusions: the underdamped Langevin dynamics (ULD) and the Langevin dynamics with a non-symmetric drift (NLD).

State-Dependent Temperature Control for Langevin Diffusions

no code implementations15 Nov 2020 Xuefeng Gao, Zuo Quan Xu, Xun Yu Zhou

We study the temperature control problem for Langevin diffusions in the context of non-convex optimization.

Decentralized Stochastic Gradient Langevin Dynamics and Hamiltonian Monte Carlo

no code implementations1 Jul 2020 Mert Gürbüzbalaban, Xuefeng Gao, Yuanhan Hu, Lingjiong Zhu

Stochastic gradient Langevin dynamics (SGLD) and stochastic gradient Hamiltonian Monte Carlo (SGHMC) are two popular Markov Chain Monte Carlo (MCMC) algorithms for Bayesian inference that can scale to large datasets, allowing to sample from the posterior distribution of the parameters of a statistical model given the input data and the prior distribution over the model parameters.

Bayesian Inference regression

Non-Convex Optimization via Non-Reversible Stochastic Gradient Langevin Dynamics

no code implementations6 Apr 2020 Yuanhan Hu, Xiaoyu Wang, Xuefeng Gao, Mert Gurbuzbalaban, Lingjiong Zhu

In this paper, we study the non reversible Stochastic Gradient Langevin Dynamics (NSGLD) which is based on discretization of the non-reversible Langevin diffusion.

Stochastic Optimization

Regime Switching Bandits

no code implementations NeurIPS 2021 Xiang Zhou, Yi Xiong, Ningyuan Chen, Xuefeng Gao

We study a multi-armed bandit problem where the rewards exhibit regime switching.

Breaking Reversibility Accelerates Langevin Dynamics for Global Non-Convex Optimization

no code implementations19 Dec 2018 Xuefeng Gao, Mert Gurbuzbalaban, Lingjiong Zhu

We study two variants that are based on non-reversible Langevin diffusions: the underdamped Langevin dynamics (ULD) and the Langevin dynamics with a non-symmetric drift (NLD).

Global Convergence of Stochastic Gradient Hamiltonian Monte Carlo for Non-Convex Stochastic Optimization: Non-Asymptotic Performance Bounds and Momentum-Based Acceleration

no code implementations12 Sep 2018 Xuefeng Gao, Mert Gürbüzbalaban, Lingjiong Zhu

We provide finite-time performance bounds for the global convergence of both SGHMC variants for solving stochastic non-convex optimization problems with explicit constants.

Stochastic Optimization

Optimal Market Making in the Presence of Latency

no code implementations15 Jun 2018 Xuefeng Gao, Yunhan Wang

This paper studies optimal market making for large-tick assets in the presence of latency.

Cannot find the paper you are looking for? You can Submit a new open access paper.