Search Results for author: Rohan Bindu

Found 3 papers, 0 papers with code

LLM Agents can Autonomously Exploit One-day Vulnerabilities

no code implementations11 Apr 2024 Richard Fang, Rohan Bindu, Akul Gupta, Daniel Kang

In this work, we show that LLM agents can autonomously exploit one-day vulnerabilities in real-world systems.

GPT-3.5 GPT-4

LLM Agents can Autonomously Hack Websites

no code implementations6 Feb 2024 Richard Fang, Rohan Bindu, Akul Gupta, Qiusi Zhan, Daniel Kang

However, not much is known about the offensive capabilities of LLM agents.

GPT-4

Removing RLHF Protections in GPT-4 via Fine-Tuning

no code implementations9 Nov 2023 Qiusi Zhan, Richard Fang, Rohan Bindu, Akul Gupta, Tatsunori Hashimoto, Daniel Kang

In tandem, LLM vendors have been increasingly enabling fine-tuning of their most powerful models.

GPT-4

Cannot find the paper you are looking for? You can Submit a new open access paper.