Search Results for author: Chunwei Wu

Found 4 papers, 1 papers with code

LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models

no code implementations25 May 2025 Fengqi Zhu, Rongzhen Wang, Shen Nie, Xiaolu Zhang, Chunwei Wu, Jun Hu, Jun Zhou, Jianfei Chen, Yankai Lin, Ji-Rong Wen, Chongxuan Li

To address this issue, we propose Variance-Reduced Preference Optimization (VRPO), a framework that formally analyzes the variance of ELBO estimators and derives bounds on both the bias and variance of preference optimization gradients.

GSM8K HumanEval +3

Chaos to Order: A Label Propagation Perspective on Source-Free Domain Adaptation

no code implementations20 Jan 2023 Chunwei Wu, Guitao Cao, Yan Li, Xidong Xi, Wenming Cao, Hong Wang

Inspired by this insight, we present Chaos to Order (CtO), a novel approach for SFDA that strives to constrain semantic credibility and propagate label information among target subpopulations.

Clustering Source-Free Domain Adaptation

Cannot find the paper you are looking for? You can Submit a new open access paper.