Search Results for author: Guodong Zheng

Found 5 papers, 3 papers with code

A Unified Analysis for Finite Weight Averaging

no code implementations20 Nov 2024 Peng Wang, Li Shen, Zerui Tao, Yan Sun, Guodong Zheng, DaCheng Tao

In this work, we first generalize SGD and LAWA as Finite Weight Averaging (FWA) and explain their advantages compared to SGD from the perspective of optimization and generalization.

Mathematical Induction

RMB: Comprehensively Benchmarking Reward Models in LLM Alignment

1 code implementation13 Oct 2024 Enyu Zhou, Guodong Zheng, Binghai Wang, Zhiheng Xi, Shihan Dou, Rong Bao, Wei Shen, Limao Xiong, Jessica Fan, Yurong Mou, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang

However, the current evaluation of RMs may not directly correspond to their alignment performance due to the limited distribution of evaluation data and evaluation methods that are not closely related to alignment objectives.

Benchmarking

Cannot find the paper you are looking for? You can Submit a new open access paper.