Search Results for author: Yixin Tan

Found 6 papers, 2 papers with code

More Efficient Randomized Exploration for Reinforcement Learning via Approximate Sampling

1 code implementation18 Jun 2024 Haque Ishfaq, Yixin Tan, Yu Yang, Qingfeng Lan, Jianfeng Lu, A. Rupam Mahmood, Doina Precup, Pan Xu

Empirically, we show that in tasks where deep exploration is necessary, our proposed algorithms that combine FGTS and approximate sampling perform significantly better compared to other strong baselines.

reinforcement-learning Reinforcement Learning (RL) +1

Variational Schrödinger Diffusion Models

no code implementations8 May 2024 Wei Deng, Weijian Luo, Yixin Tan, Marin Biloš, Yu Chen, Yuriy Nevmyvaka, Ricky T. Q. Chen

To improve the scalability while preserving efficient transportation plans, we leverage variational inference to linearize the forward score functions (variational scores) of SB and restore simulation-free properties in training backward scores.

Variational Inference

Convergence of flow-based generative models via proximal gradient descent in Wasserstein space

1 code implementation26 Oct 2023 Xiuyuan Cheng, Jianfeng Lu, Yixin Tan, Yao Xie

Leveraging the exponential convergence of the proximal gradient descent (GD) in Wasserstein space, we prove the Kullback-Leibler (KL) guarantee of data generation by a JKO flow model to be $O(\varepsilon^2)$ when using $N \lesssim \log (1/\varepsilon)$ many JKO steps ($N$ Residual Blocks in the flow) where $\varepsilon $ is the error in the per-step first-order condition.

Convergence of score-based generative modeling for general data distributions

no code implementations26 Sep 2022 Holden Lee, Jianfeng Lu, Yixin Tan

Score-based generative modeling (SGM) has grown to be a hugely successful method for learning to generate samples from complex data distributions such as that of images and audio.

Denoising

Convergence for score-based generative modeling with polynomial complexity

no code implementations13 Jun 2022 Holden Lee, Jianfeng Lu, Yixin Tan

Using our guarantee, we give a theoretical analysis of score-based generative modeling, which transforms white-noise input into samples from a learned data distribution given score estimates at different noise scales.

Cannot find the paper you are looking for? You can Submit a new open access paper.