Search Results for author: Nahema Marchal

Found 7 papers, 0 papers with code

Generative AI Misuse: A Taxonomy of Tactics and Insights from Real-World Data

no code implementations19 Jun 2024 Nahema Marchal, Rachel Xu, Rasmi Elasmar, Iason Gabriel, Beth Goldberg, William Isaac

Generative, multimodal artificial intelligence (GenAI) offers transformative potential across industries, but its misuse poses significant risks.

STAR: SocioTechnical Approach to Red Teaming Language Models

no code implementations17 Jun 2024 Laura Weidinger, John Mellor, Bernat Guillen Pegueroles, Nahema Marchal, Ravin Kumar, Kristian Lum, Canfer Akbulut, Mark Diaz, Stevie Bergman, Mikel Rodriguez, Verena Rieser, William Isaac

This research introduces STAR, a sociotechnical framework that improves on current best practices for red teaming safety of large language models.

Red Teaming

Cannot find the paper you are looking for? You can Submit a new open access paper.