Search Results for author: Peter Ly

Found 1 papers, 1 papers with code

Bergeron: Combating Adversarial Attacks through a Conscience-Based Alignment Framework

1 code implementation • 16 Nov 2023 • Matthew Pisano, Peter Ly, Abraham Sanders, Bingsheng Yao, Dakuo Wang, Tomek Strzalkowski, Mei Si

To help mitigate this issue, we introduce Bergeron: a framework designed to improve the robustness of LLMs against attacks without any additional parameter fine-tuning.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.