no code implementations • 30 Dec 2024 • Pengfei Jing, Mengyun Tang, Xiaorong Shi, Xing Zheng, Sen Nie, Shi Wu, Yong Yang, Xiapu Luo
To address these gaps, we propose SecBench, a multi-dimensional benchmarking dataset designed to evaluate LLMs in the cybersecurity domain.
3 code implementations • 13 Nov 2024 • Yingqi Gao, Yifu Liu, Xiaoxia Li, Xiaorong Shi, Yin Zhu, Yiming Wang, Shiqi Li, Wei Li, Yuntao Hong, Zhiling Luo, Jinyang Gao, Liyu Mou, Yu Li
On the other hand, we implement the ICL approach with an example selection method based on named entity recognition to prevent overemphasis on entities.
Ranked #1 on
Text-To-SQL
on spider