Search Results for author: Divij Handa

Found 2 papers, 0 papers with code

Jailbreaking Proprietary Large Language Models using Word Substitution Cipher

no code implementations • 16 Feb 2024 • Divij Handa, Advait Chirmule, Bimal Gajera, Chitta Baral

We first present a pilot study on the state-of-the-art LLM, GPT-4, in decoding several safe sentences that have been encrypted using various cryptographic techniques and find that a straightforward word substitution cipher can be decoded most effectively.

Paper
Add Code

Can NLP Models Correctly Reason Over Contexts that Break the Common Assumptions?

no code implementations • 20 May 2023 • Neeraj Varshney, Mihir Parmar, Nisarg Patel, Divij Handa, Sayantan Sarkar, Man Luo, Chitta Baral

Can state-of-the-art NLP models correctly reason over the contexts of such scenarios?

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.