Search Results

LLM Post-Training: A Deep Dive into Reasoning Large Language Models

1 code implementation28 Feb 2025

Large Language Models (LLMs) have transformed the natural language processing landscape and brought to life diverse applications.

LLMs as Hackers: Autonomous Linux Privilege Escalation Attacks

1 code implementation17 Oct 2023

We explore the intersection of LLMs and penetration testing to gain insight into their capabilities and challenges in the context of privilege escalation.

In-Context Learning

CIPHER: Cybersecurity Intelligent Penetration-testing Helper for Ethical Researcher

1 code implementation21 Aug 2024

Additionally, we introduced the Findings, Action, Reasoning, and Results (FARR) Flow augmentation, a novel method to augment penetration testing write-ups to establish a fully automated pentesting simulation benchmark tailored for large language models.

Language Modelling Large Language Model

The Agent Web Model -- Modelling web hacking for reinforcement learning

1 code implementation23 Sep 2020

Website hacking is a frequent attack type used by malicious actors to obtain confidential information, modify the integrity of web pages or make websites unavailable.

Cryptography and Security

Towards Automated Penetration Testing: Introducing LLM Benchmark, Analysis, and Improvements

1 code implementation22 Oct 2024

We first evaluate the performance of LLMs, including GPT-4o and LLama 3. 1-405B, using the state-of-the-art PentestGPT tool.

Machine Learning Featurizations for AI Hacking of Political Systems

1 code implementation8 Oct 2021

What would the inputs be to a machine whose output is the destabilization of a robust democracy, or whose emanations could disrupt the political power of nations?

BIG-bench Machine Learning

PenTest++: Elevating Ethical Hacking with AI and Automation

no code implementations13 Feb 2025

Traditional ethical hacking relies on skilled professionals and time-intensive command management, which limits its scalability and efficiency.

Decision Making

AI-Enhanced Ethical Hacking: A Linux-Focused Experiment

no code implementations7 Oct 2024

This technical report investigates the integration of generative AI (GenAI), specifically ChatGPT, into the practice of ethical hacking through a comprehensive experimental study and conceptual analysis.

Hallucination

AI-Augmented Ethical Hacking: A Practical Examination of Manual Exploitation and Privilege Escalation in Linux Environments

no code implementations26 Nov 2024

This study explores the application of generative AI (GenAI) within manual exploitation and privilege escalation tasks in Linux-based penetration testing environments, two areas critical to comprehensive cybersecurity assessments.

Decision Making

Cannot find the paper you are looking for? You can Submit a new open access paper.