Search Results for author: Wenjing Luo

Found 4 papers, 3 papers with code

SysBench: Can Large Language Models Follow System Messages?

1 code implementation20 Aug 2024 Yanzhao Qin, Tao Zhang, Yanjun Shen, Wenjing Luo, Haoze Sun, Yan Zhang, Yujing Qiao, WeiPeng Chen, Zenan Zhou, Wentao Zhang, Bin Cui

Finally, we conduct extensive evaluation across various existing LLMs, measuring their ability to follow specified constraints given in system messages.

CFBench: A Comprehensive Constraints-Following Benchmark for LLMs

1 code implementation2 Aug 2024 Chenglin Zhu, Yanjun Shen, Wenjing Luo, Yan Zhang, Hao Liang, Tao Zhang, Fan Yang, MingAn Lin, Yujing Qiao, WeiPeng Chen, Bin Cui, Wentao Zhang, Zenan Zhou

The adeptness of Large Language Models (LLMs) in comprehending and following natural language instructions is critical for their deployment in sophisticated real-world applications.

Cannot find the paper you are looking for? You can Submit a new open access paper.