Search Results for author: Abraham Sanders

Found 3 papers, 2 papers with code

Bergeron: Combating Adversarial Attacks through a Conscience-Based Alignment Framework

1 code implementation • 16 Nov 2023 • Matthew Pisano, Peter Ly, Abraham Sanders, Bingsheng Yao, Dakuo Wang, Tomek Strzalkowski, Mei Si

To help mitigate this issue, we introduce Bergeron: a framework designed to improve the robustness of LLMs against attacks without any additional parameter fine-tuning.

Paper
Code

Towards a Progression-Aware Autonomous Dialogue Agent

no code implementations • NAACL 2022 • Abraham Sanders, Tomek Strzalkowski, Mei Si, Albert Chang, Deepanshu Dey, Jonas Braasch, Dakuo Wang

Recent advances in large-scale language modeling and generation have enabled the creation of dialogue agents that exhibit human-like responses in a wide range of conversational scenarios spanning a diverse set of tasks, from general chit-chat to focused goal-oriented discourse.

Language Modelling

Paper
Add Code

Should we tweet this? Generative response modeling for predicting reception of public health messaging on Twitter

1 code implementation • 9 Apr 2022 • Abraham Sanders, Debjani Ray-Majumder, John S. Erickson, Kristin P. Bennett

The way people respond to messaging from public health organizations on social media can provide insight into public perceptions on critical health issues, especially during a global crisis such as COVID-19.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.