Search Results for author: Zach Cameron

Found 1 papers, 0 papers with code

Don't Listen To Me: Understanding and Exploring Jailbreak Prompts of Large Language Models

no code implementations26 Mar 2024 Zhiyuan Yu, Xiaogeng Liu, Shunning Liang, Zach Cameron, Chaowei Xiao, Ning Zhang

Building on the insights from the user study, we also developed a system using AI as the assistant to automate the process of jailbreak prompt generation.

Cannot find the paper you are looking for? You can Submit a new open access paper.