Search Results for author: Xingwei Lin

Found 1 papers, 1 papers with code

GPTFUZZER: Red Teaming Large Language Models with Auto-Generated Jailbreak Prompts

1 code implementation19 Sep 2023 Jiahao Yu, Xingwei Lin, Zheng Yu, Xinyu Xing

Remarkably, GPTFuzz achieves over 90% attack success rates against ChatGPT and Llama-2 models, even with suboptimal initial seed templates.

Cannot find the paper you are looking for? You can Submit a new open access paper.