Search Results for author: Huizhuo Yuan

Found 9 papers, 1 papers with code

Policy Optimization via Stochastic Recursive Gradient Algorithm

no code implementations ICLR 2019 Huizhuo Yuan, Chris Junchi Li, Yuhao Tang, Yuren Zhou

In this paper, we propose the StochAstic Recursive grAdient Policy Optimization (SARAPO) algorithm which is a novel variance reduction method on Trust Region Policy Optimization (TRPO).

Protein Conformation Generation via Force-Guided SE(3) Diffusion Models

no code implementations21 Mar 2024 Yan Wang, Lihao Wang, Yuning Shen, Yiqun Wang, Huizhuo Yuan, Yue Wu, Quanquan Gu

The conformational landscape of proteins is crucial to understanding their functionality in complex biological processes.

Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation

no code implementations15 Feb 2024 Huizhuo Yuan, Zixiang Chen, Kaixuan Ji, Quanquan Gu

Fine-tuning Diffusion Models remains an underexplored frontier in generative artificial intelligence (GenAI), especially when compared with the remarkable progress made in fine-tuning Large Language Models (LLMs).

Reinforcement Learning (RL) Text-to-Image Generation

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

2 code implementations2 Jan 2024 Zixiang Chen, Yihe Deng, Huizhuo Yuan, Kaixuan Ji, Quanquan Gu

In this paper, we delve into the prospect of growing a strong LLM out of a weak one without the need for acquiring additional human-annotated data.

Fast Sampling via De-randomization for Discrete Diffusion Models

no code implementations14 Dec 2023 Zixiang Chen, Huizhuo Yuan, YongQian Li, Yiwen Kou, Junkai Zhang, Quanquan Gu

Despite its success in continuous spaces, discrete diffusion models, which apply to domains such as texts and natural languages, remain under-studied and often suffer from slow generation speed.

Image Generation Machine Translation +1

Stochastic Recursive Momentum for Policy Gradient Methods

no code implementations9 Mar 2020 Huizhuo Yuan, Xiangru Lian, Ji Liu, Yuren Zhou

In this paper, we propose a novel algorithm named STOchastic Recursive Momentum for Policy Gradient (STORM-PG), which operates a SARAH-type stochastic recursive variance-reduced policy gradient in an exponential moving average fashion.

Policy Gradient Methods

Stochastic Modified Equations for Continuous Limit of Stochastic ADMM

no code implementations7 Mar 2020 Xiang Zhou, Huizhuo Yuan, Chris Junchi Li, Qingyun Sun

In this work, we put different variants of stochastic ADMM into a unified form, which includes standard, linearized and gradient-based ADMM with relaxation, and study their dynamics via a continuous-time model approach.

Stochastic Recursive Variance Reduction for Efficient Smooth Non-Convex Compositional Optimization

no code implementations31 Dec 2019 Huizhuo Yuan, Xiangru Lian, Ji Liu

Such a complexity is known to be the best one among IFO complexity results for non-convex stochastic compositional optimization, and is believed to be optimal.

Management Stochastic Optimization

Cannot find the paper you are looking for? You can Submit a new open access paper.