Search Results for author: Yurong Mou

Found 4 papers, 2 papers with code

RMB: Comprehensively Benchmarking Reward Models in LLM Alignment

1 code implementation13 Oct 2024 Enyu Zhou, Guodong Zheng, Binghai Wang, Zhiheng Xi, Shihan Dou, Rong Bao, Wei Shen, Limao Xiong, Jessica Fan, Yurong Mou, Rui Zheng, Tao Gui, Qi Zhang, Xuanjing Huang

However, the current evaluation of RMs may not directly correspond to their alignment performance due to the limited distribution of evaluation data and evaluation methods that are not closely related to alignment objectives.

Benchmarking

Exploring the Compositional Deficiency of Large Language Models in Mathematical Reasoning

1 code implementation5 May 2024 Jun Zhao, Jingqi Tong, Yurong Mou, Ming Zhang, Qi Zhang, Xuanjing Huang

In this work, we investigate the compositionality of large language models (LLMs) in mathematical reasoning.

GSM8K Math +1

Cannot find the paper you are looking for? You can Submit a new open access paper.