Search Results for author: Su Dong

Found 1 papers, 0 papers with code

Benchmarking Large Language Models via Random Variables

no code implementations20 Jan 2025 Zijin Hong, Hao Wu, Su Dong, Junnan Dong, Yilin Xiao, Yujing Zhang, Zhu Wang, Feiran Huang, Linyi Li, Hongxia Yang, Xiao Huang

LLMs must fully understand the problem-solving process for the original problem to correctly answer RV questions with various combinations of variable values.

Benchmarking Mathematical Reasoning

Cannot find the paper you are looking for? You can Submit a new open access paper.