Search Results for author: Mark Purcell

Found 15 papers, 4 papers with code

MAD-MAX: Modular And Diverse Malicious Attack MiXtures for Automated LLM Red Teaming

no code implementations8 Mar 2025 Stefan Schoepf, Muhammad Zaid Hameed, Ambrish Rawat, Kieran Fraser, Giulio Zizzo, Giandomenico Cornacchia, Mark Purcell

The MAD-MAX approach is designed to be easily extensible with newly discovered attack strategies and outperforms the prominent Red Teaming method Tree of Attacks with Pruning (TAP) significantly in terms of Attack Success Rate (ASR) and queries needed to achieve jailbreaks.

Red Teaming

Towards Assurance of LLM Adversarial Robustness using Ontology-Driven Argumentation

no code implementations10 Oct 2024 Beat Buesser, Giulio Zizzo, Mark Purcell, Tomas Bueno Momcilovic

Despite the impressive adaptability of large language models (LLMs), challenges remain in ensuring their security, transparency, and interpretability.

Adversarial Robustness Code Translation

Towards Assuring EU AI Act Compliance and Adversarial Robustness of LLMs

no code implementations4 Oct 2024 Tomas Bueno Momcilovic, Beat Buesser, Giulio Zizzo, Mark Purcell, Dian Balta

Large language models are prone to misuse and vulnerable to security threats, raising significant safety and security concerns.

Adversarial Robustness

Developing Assurance Cases for Adversarial Robustness and Regulatory Compliance in LLMs

no code implementations4 Oct 2024 Tomas Bueno Momcilovic, Dian Balta, Beat Buesser, Giulio Zizzo, Mark Purcell

This paper presents an approach to developing assurance cases for adversarial robustness and regulatory compliance in large language models (LLMs).

Adversarial Robustness Management

Knowledge-Augmented Reasoning for EUAIA Compliance and Adversarial Robustness of LLMs

no code implementations4 Oct 2024 Tomas Bueno Momcilovic, Dian Balta, Beat Buesser, Giulio Zizzo, Mark Purcell

The EU AI Act (EUAIA) introduces requirements for AI systems which intersect with the processes required to establish adversarial robustness.

Adversarial Robustness

MoJE: Mixture of Jailbreak Experts, Naive Tabular Classifiers as Guard for Prompt Attacks

no code implementations26 Sep 2024 Giandomenico Cornacchia, Giulio Zizzo, Kieran Fraser, Muhammad Zaid Hameed, Ambrish Rawat, Mark Purcell

The proliferation of Large Language Models (LLMs) in diverse applications underscores the pressing need for robust security measures to thwart potential jailbreak attacks.

Computational Efficiency

Attack Atlas: A Practitioner's Perspective on Challenges and Pitfalls in Red Teaming GenAI

no code implementations23 Sep 2024 Ambrish Rawat, Stefan Schoepf, Giulio Zizzo, Giandomenico Cornacchia, Muhammad Zaid Hameed, Kieran Fraser, Erik Miehling, Beat Buesser, Elizabeth M. Daly, Mark Purcell, Prasanna Sattigeri, Pin-Yu Chen, Kush R. Varshney

As generative AI, particularly large language models (LLMs), become increasingly integrated into production applications, new attack surfaces and vulnerabilities emerge and put a focus on adversarial threats in natural language and multi-modal systems.

Red Teaming

Knowledge- and Data-driven Services for Energy Systems using Graph Neural Networks

no code implementations12 Mar 2021 Francesco Fusco, Bradley Eck, Robert Gormally, Mark Purcell, Seshu Tirupathi

The transition away from carbon-based energy sources poses several challenges for the operation of electricity distribution systems.

Decision Making Physical Simulations

Scalable Deployment of AI Time-series Models for IoT

no code implementations24 Mar 2020 Bradley Eck, Francesco Fusco, Robert Gormally, Mark Purcell, Seshu Tirupathi

A knowledge-based approach to managing model and time-series data allows the use of general semantic concepts for expressing feature engineering tasks.

Cloud Computing Feature Engineering +2

Castor: Contextual IoT Time Series Data and Model Management at Scale

1 code implementation20 Nov 2018 Bei Chen, Bradley Eck, Francesco Fusco, Robert Gormally, Mark Purcell, Mathieu Sinn, Seshu Tirupathi

The main features of Castor are: (1) an efficient pipeline for ingesting IoT time series data in real time; (2) a scalable, hybrid data management service for both time series and contextual data; (3) a versatile semantic model for contextual information which can be easily adopted to different application domains; (4) an abstract framework for developing and storing predictive models in R or Python; (5) deployment services which automatically train and/or score predictive models upon user-defined conditions.

Computation Other Statistics

Cannot find the paper you are looking for? You can Submit a new open access paper.