Search Results for author: Yulai Zhao

Found 7 papers, 1 papers with code

Feedback Efficient Online Fine-Tuning of Diffusion Models

no code implementations26 Feb 2024 Masatoshi Uehara, Yulai Zhao, Kevin Black, Ehsan Hajiramezanali, Gabriele Scalia, Nathaniel Lee Diamant, Alex M Tseng, Sergey Levine, Tommaso Biancalani

It is natural to frame this as a reinforcement learning (RL) problem, in which the objective is to fine-tune a diffusion model to maximize a reward function that corresponds to some property.

reinforcement-learning Reinforcement Learning (RL)

Local Optimization Achieves Global Optimality in Multi-Agent Reinforcement Learning

1 code implementation8 May 2023 Yulai Zhao, Zhuoran Yang, Zhaoran Wang, Jason D. Lee

Motivated by the observation, we present a multi-agent PPO algorithm in which the local policy of each agent is updated similarly to vanilla PPO.

LEMMA Multi-agent Reinforcement Learning +1

Blessing of Class Diversity in Pre-training

no code implementations7 Sep 2022 Yulai Zhao, Jianshu Chen, Simon S. Du

Here, $n$ is the number of pre-training data and $m$ is the number of data in the downstream task, and typically $n \gg m$.

Language Modelling Transfer Learning

Optimizing the Performative Risk under Weak Convexity Assumptions

no code implementations2 Sep 2022 Yulai Zhao

The core difficulty of using the performative risk as an optimization objective is that the data distribution itself depends on the model parameters.

Provably Efficient Policy Optimization for Two-Player Zero-Sum Markov Games

no code implementations17 Feb 2021 Yulai Zhao, Yuandong Tian, Jason D. Lee, Simon S. Du

Policy-based methods with function approximation are widely used for solving two-player zero-sum games with large state and/or action spaces.

Policy Gradient Methods Vocal Bursts Valence Prediction

Cannot find the paper you are looking for? You can Submit a new open access paper.