Search Results for author: Bhaskar Ramasubramanian

Found 18 papers, 6 papers with code

ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs

1 code implementation19 Feb 2024 Fengqing Jiang, Zhangchen Xu, Luyao Niu, Zhen Xiang, Bhaskar Ramasubramanian, Bo Li, Radha Poovendran

In this paper, we propose a novel ASCII art-based jailbreak attack and introduce a comprehensive benchmark Vision-in-Text Challenge (ViTC) to evaluate the capabilities of LLMs in recognizing prompts that cannot be solely interpreted by semantics.

Game of Trojans: Adaptive Adversaries Against Output-based Trojaned-Model Detectors

no code implementations12 Feb 2024 Dinuka Sahabandu, Xiaojun Xu, Arezoo Rajabi, Luyao Niu, Bhaskar Ramasubramanian, Bo Li, Radha Poovendran

We propose and analyze an adaptive adversary that can retrain a Trojaned DNN and is also aware of SOTA output-based Trojaned model detectors.

BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models

1 code implementation20 Jan 2024 Zhen Xiang, Fengqing Jiang, Zidi Xiong, Bhaskar Ramasubramanian, Radha Poovendran, Bo Li

Moreover, we show that LLMs endowed with stronger reasoning capabilities exhibit higher susceptibility to BadChain, exemplified by a high average attack success rate of 97. 0% across the six benchmark tasks on GPT-4.

Backdoor Attack

MDTD: A Multi Domain Trojan Detector for Deep Neural Networks

1 code implementation30 Aug 2023 Arezoo Rajabi, Surudhi Asokraj, Fengqing Jiang, Luyao Niu, Bhaskar Ramasubramanian, Jim Ritcey, Radha Poovendran

An adversary carrying out a backdoor attack embeds a predefined perturbation called a trigger into a small subset of input samples and trains the DNN such that the presence of the trigger in the input results in an adversary-desired output class.

Backdoor Attack

Risk-Aware Distributed Multi-Agent Reinforcement Learning

no code implementations4 Apr 2023 Abdullah Al Maruf, Luyao Niu, Bhaskar Ramasubramanian, Andrew Clark, Radha Poovendran

We then propose a distributed MARL algorithm called the CVaR QD-Learning algorithm, and establish that value functions of individual agents reaches consensus.

Decision Making Multi-agent Reinforcement Learning +1

LDL: A Defense for Label-Based Membership Inference Attacks

no code implementations3 Dec 2022 Arezoo Rajabi, Dinuka Sahabandu, Luyao Niu, Bhaskar Ramasubramanian, Radha Poovendran

Overfitted models have been shown to be susceptible to query-based attacks such as membership inference attacks (MIAs).

Game of Trojans: A Submodular Byzantine Approach

no code implementations13 Jul 2022 Dinuka Sahabandu, Arezoo Rajabi, Luyao Niu, Bo Li, Bhaskar Ramasubramanian, Radha Poovendran

The results show that (i) with Submodular Trojan algorithm, the adversary needs to embed a Trojan trigger into a very small fraction of samples to achieve high accuracy on both Trojan and clean samples, and (ii) the MM Trojan algorithm yields a trained Trojan model that evades detection with probability 1.

Privacy-Preserving Reinforcement Learning Beyond Expectation

no code implementations18 Mar 2022 Arezoo Rajabi, Bhaskar Ramasubramanian, Abdullah Al Maruf, Radha Poovendran

Through empirical evaluations, we highlight a privacy-utility tradeoff and demonstrate that the RL agent is able to learn behaviors that are aligned with that of a human user in the same environment in a privacy-preserving manner

Decision Making Privacy Preserving +2

Shaping Advice in Deep Reinforcement Learning

1 code implementation19 Feb 2022 Baicen Xiao, Bhaskar Ramasubramanian, Radha Poovendran

We design two algorithms- Shaping Advice in Single-agent reinforcement learning (SAS) and Shaping Advice in Multi-agent reinforcement learning (SAM).

Multi-agent Reinforcement Learning reinforcement-learning +1

Agent-Temporal Attention for Reward Redistribution in Episodic Multi-Agent Reinforcement Learning

1 code implementation12 Jan 2022 Baicen Xiao, Bhaskar Ramasubramanian, Radha Poovendran

In this paper, we introduce Agent-Temporal Attention for Reward Redistribution in Episodic Multi-Agent Reinforcement Learning (AREL) to address these two challenges.

Multi-agent Reinforcement Learning reinforcement-learning +2

Resilience to Denial-of-Service and Integrity Attacks: A Structured Systems Approach

no code implementations2 Sep 2021 Bhaskar Ramasubramanian, M. A. Rajan, M. Girish Chandra, Rance Cleaveland, Steven I. Marcus

The resilience of cyberphysical systems to denial-of-service (DoS) and integrity attacks is studied in this paper.

Reinforcement Learning Beyond Expectation

no code implementations29 Mar 2021 Bhaskar Ramasubramanian, Luyao Niu, Andrew Clark, Radha Poovendran

In this paper, we consider a setting where an autonomous agent has to learn behaviors in an unknown environment.

reinforcement-learning Reinforcement Learning (RL)

Shaping Advice in Deep Multi-Agent Reinforcement Learning

1 code implementation29 Mar 2021 Baicen Xiao, Bhaskar Ramasubramanian, Radha Poovendran

We observe that using SAM results in agents learning policies to complete tasks faster, and obtain higher rewards than: i) using sparse rewards alone; ii) a state-of-the-art reward redistribution method.

Multi-agent Reinforcement Learning reinforcement-learning +1

Safety-Critical Online Control with Adversarial Disturbances

no code implementations20 Sep 2020 Bhaskar Ramasubramanian, Baicen Xiao, Linda Bushnell, Radha Poovendran

We propose an iterative approach to the synthesis of the controller by solving a modified discrete-time Riccati equation.

Potential-Based Advice for Stochastic Policy Learning

no code implementations20 Jul 2019 Baicen Xiao, Bhaskar Ramasubramanian, Andrew Clark, Hannaneh Hajishirzi, Linda Bushnell, Radha Poovendran

This paper augments the reward received by a reinforcement learning agent with potential functions in order to help the agent learn (possibly stochastic) optimal policies.

Q-Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.