Search Results for author: Tanay Wakhare

Found 1 papers, 1 papers with code

Prompt have evil twins

1 code implementation13 Nov 2023 Rimon Melamed, Lucas H. McCabe, Tanay Wakhare, Yejin Kim, H. Howie Huang, Enric Boix-Adsera

We discover that many natural-language prompts can be replaced by corresponding prompts that are unintelligible to humans but that provably elicit similar behavior in language models.

Cannot find the paper you are looking for? You can Submit a new open access paper.