Search Results for author: Zhenxuan Zhang

Found 1 papers, 0 papers with code

A Chinese Dataset for Evaluating the Safeguards in Large Language Models

no code implementations • 19 Feb 2024 • Yuxia Wang, Zenan Zhai, Haonan Li, Xudong Han, Lizhi Lin, Zhenxuan Zhang, Jingru Zhao, Preslav Nakov, Timothy Baldwin

Previous studies have proposed comprehensive taxonomies of the risks posed by LLMs, as well as corresponding prompts that can be used to examine the safety mechanisms of LLMs.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.