1 code implementation • 15 Nov 2023 • Zhexin Zhang, Junxiao Yang, Pei Ke, Minlie Huang
We hope our work could contribute to the comprehension of jailbreaking attacks and defenses, and shed light on the relationship between LLMs' capability and safety.