Search Results for author: Wanqian Guo

Found 1 papers, 0 papers with code

EnviroExam: Benchmarking Environmental Science Knowledge of Large Language Models

no code implementations18 May 2024 Yu Huang, Liang Guo, Wanqian Guo, Zhe Tao, Yang Lv, Zhihao Sun, Dongfang Zhao

In the field of environmental science, it is crucial to have robust evaluation metrics for large language models to ensure their efficacy and accuracy.

Benchmarking Specificity

Cannot find the paper you are looking for? You can Submit a new open access paper.