Search Results for author: Hyrum Anderson

Found 5 papers, 2 papers with code

Metadata-Based Detection of Child Sexual Abuse Material

no code implementations5 Oct 2020 Mayana Pereira, Rahul Dodhia, Hyrum Anderson, Richard Brown

With such restrictions in place, the development of CSAM machine learning detection systems based on file metadata uncovers several opportunities.

BIG-bench Machine Learning

Tree of Attacks: Jailbreaking Black-Box LLMs Automatically

1 code implementation4 Dec 2023 Anay Mehrotra, Manolis Zampetakis, Paul Kassianik, Blaine Nelson, Hyrum Anderson, Yaron Singer, Amin Karbasi

In this work, we present Tree of Attacks with Pruning (TAP), an automated method for generating jailbreaks that only requires black-box access to the target LLM.

Navigate

Cannot find the paper you are looking for? You can Submit a new open access paper.