Search Results for author: Zhiwei Fei

Found 2 papers, 2 papers with code

MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark

1 code implementation • 20 May 2024 • Hongwei Liu, Zilong Zheng, Yuxuan Qiao, Haodong Duan, Zhiwei Fei, Fengzhe Zhou, Wenwei Zhang, Songyang Zhang, Dahua Lin, Kai Chen

MathBench spans a wide range of mathematical disciplines, offering a detailed evaluation of both theoretical understanding and practical problem-solving skills.

College Mathematics GSM8K +1

Paper
Code

LawBench: Benchmarking Legal Knowledge of Large Language Models

1 code implementation • 28 Sep 2023 • Zhiwei Fei, Xiaoyu Shen, Dawei Zhu, Fengzhe Zhou, Zhuo Han, Songyang Zhang, Kai Chen, Zongwen Shen, Jidong Ge

We hope this benchmark provides in-depth understanding of the LLMs' domain-specified capabilities and speed up the development of LLMs in the legal domain.

Benchmarking Memorization +1

190

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.