Search Results for author: Zhenxuan Zhang

Found 1 papers, 0 papers with code

A Chinese Dataset for Evaluating the Safeguards in Large Language Models

no code implementations19 Feb 2024 Yuxia Wang, Zenan Zhai, Haonan Li, Xudong Han, Lizhi Lin, Zhenxuan Zhang, Jingru Zhao, Preslav Nakov, Timothy Baldwin

Previous studies have proposed comprehensive taxonomies of the risks posed by LLMs, as well as corresponding prompts that can be used to examine the safety mechanisms of LLMs.

Cannot find the paper you are looking for? You can Submit a new open access paper.