Search Results for author: Bhaskar Ramasubramanian

Found 18 papers, 6 papers with code

ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs

1 code implementation • 19 Feb 2024 • Fengqing Jiang, Zhangchen Xu, Luyao Niu, Zhen Xiang, Bhaskar Ramasubramanian, Bo Li, Radha Poovendran

In this paper, we propose a novel ASCII art-based jailbreak attack and introduce a comprehensive benchmark Vision-in-Text Challenge (ViTC) to evaluate the capabilities of LLMs in recognizing prompts that cannot be solely interpreted by semantics.

Paper
Code

Game of Trojans: Adaptive Adversaries Against Output-based Trojaned-Model Detectors

no code implementations • 12 Feb 2024 • Dinuka Sahabandu, Xiaojun Xu, Arezoo Rajabi, Luyao Niu, Bhaskar Ramasubramanian, Bo Li, Radha Poovendran

We propose and analyze an adaptive adversary that can retrain a Trojaned DNN and is also aware of SOTA output-based Trojaned model detectors.

Paper
Add Code

Double-Dip: Thwarting Label-Only Membership Inference Attacks with Transfer Learning and Randomization

no code implementations • 2 Feb 2024 • Arezoo Rajabi, Reeya Pimple, Aiswarya Janardhanan, Surudhi Asokraj, Bhaskar Ramasubramanian, Radha Poovendran

However, the suitability of TL as a solution to reduce vulnerability of overfitted DNNs to privacy attacks is unexplored.

Transfer Learning

Paper
Add Code

BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models

1 code implementation • 20 Jan 2024 • Zhen Xiang, Fengqing Jiang, Zidi Xiong, Bhaskar Ramasubramanian, Radha Poovendran, Bo Li

Moreover, we show that LLMs endowed with stronger reasoning capabilities exhibit higher susceptibility to BadChain, exemplified by a high average attack success rate of 97. 0% across the six benchmark tasks on GPT-4.

Backdoor Attack

Paper
Code

MDTD: A Multi Domain Trojan Detector for Deep Neural Networks

1 code implementation • 30 Aug 2023 • Arezoo Rajabi, Surudhi Asokraj, Fengqing Jiang, Luyao Niu, Bhaskar Ramasubramanian, Jim Ritcey, Radha Poovendran

An adversary carrying out a backdoor attack embeds a predefined perturbation called a trigger into a small subset of input samples and trains the DNN such that the presence of the trigger in the input results in an adversary-desired output class.

Backdoor Attack

Paper
Code

Risk-Aware Distributed Multi-Agent Reinforcement Learning

no code implementations • 4 Apr 2023 • Abdullah Al Maruf, Luyao Niu, Bhaskar Ramasubramanian, Andrew Clark, Radha Poovendran

We then propose a distributed MARL algorithm called the CVaR QD-Learning algorithm, and establish that value functions of individual agents reaches consensus.

Decision Making Multi-agent Reinforcement Learning +1

Paper
Add Code

LDL: A Defense for Label-Based Membership Inference Attacks

no code implementations • 3 Dec 2022 • Arezoo Rajabi, Dinuka Sahabandu, Luyao Niu, Bhaskar Ramasubramanian, Radha Poovendran

Overfitted models have been shown to be susceptible to query-based attacks such as membership inference attacks (MIAs).

Paper
Add Code

Game of Trojans: A Submodular Byzantine Approach

no code implementations • 13 Jul 2022 • Dinuka Sahabandu, Arezoo Rajabi, Luyao Niu, Bo Li, Bhaskar Ramasubramanian, Radha Poovendran

The results show that (i) with Submodular Trojan algorithm, the adversary needs to embed a Trojan trigger into a very small fraction of samples to achieve high accuracy on both Trojan and clean samples, and (ii) the MM Trojan algorithm yields a trained Trojan model that evades detection with probability 1.

Paper
Add Code

Trojan Horse Training for Breaking Defenses against Backdoor Attacks in Deep Learning

no code implementations • 25 Mar 2022 • Arezoo Rajabi, Bhaskar Ramasubramanian, Radha Poovendran

We term this category of attacks multi-target backdoor attacks.

Backdoor Attack

Paper
Add Code

Privacy-Preserving Reinforcement Learning Beyond Expectation

no code implementations • 18 Mar 2022 • Arezoo Rajabi, Bhaskar Ramasubramanian, Abdullah Al Maruf, Radha Poovendran

Through empirical evaluations, we highlight a privacy-utility tradeoff and demonstrate that the RL agent is able to learn behaviors that are aligned with that of a human user in the same environment in a privacy-preserving manner

Decision Making Privacy Preserving +2

Paper
Add Code

Shaping Advice in Deep Reinforcement Learning

1 code implementation • 19 Feb 2022 • Baicen Xiao, Bhaskar Ramasubramanian, Radha Poovendran

We design two algorithms- Shaping Advice in Single-agent reinforcement learning (SAS) and Shaping Advice in Multi-agent reinforcement learning (SAM).

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Code

Agent-Temporal Attention for Reward Redistribution in Episodic Multi-Agent Reinforcement Learning

1 code implementation • 12 Jan 2022 • Baicen Xiao, Bhaskar Ramasubramanian, Radha Poovendran

In this paper, we introduce Agent-Temporal Attention for Reward Redistribution in Episodic Multi-Agent Reinforcement Learning (AREL) to address these two challenges.

Multi-agent Reinforcement Learning reinforcement-learning +2

Paper
Code

Resilience to Denial-of-Service and Integrity Attacks: A Structured Systems Approach

no code implementations • 2 Sep 2021 • Bhaskar Ramasubramanian, M. A. Rajan, M. Girish Chandra, Rance Cleaveland, Steven I. Marcus

The resilience of cyberphysical systems to denial-of-service (DoS) and integrity attacks is studied in this paper.

Paper
Add Code

Reinforcement Learning Beyond Expectation

no code implementations • 29 Mar 2021 • Bhaskar Ramasubramanian, Luyao Niu, Andrew Clark, Radha Poovendran

In this paper, we consider a setting where an autonomous agent has to learn behaviors in an unknown environment.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Shaping Advice in Deep Multi-Agent Reinforcement Learning

1 code implementation • 29 Mar 2021 • Baicen Xiao, Bhaskar Ramasubramanian, Radha Poovendran

We observe that using SAM results in agents learning policies to complete tasks faster, and obtain higher rewards than: i) using sparse rewards alone; ii) a state-of-the-art reward redistribution method.

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Code

Safety-Critical Online Control with Adversarial Disturbances

no code implementations • 20 Sep 2020 • Bhaskar Ramasubramanian, Baicen Xiao, Linda Bushnell, Radha Poovendran

We propose an iterative approach to the synthesis of the controller by solving a modified discrete-time Riccati equation.

Paper
Add Code

FRESH: Interactive Reward Shaping in High-Dimensional State Spaces using Human Feedback

no code implementations • 19 Jan 2020 • Baicen Xiao, Qifan Lu, Bhaskar Ramasubramanian, Andrew Clark, Linda Bushnell, Radha Poovendran

The output of the feedback neural network is converted to a shaping reward that is augmented to the reward provided by the environment.

Atari Games reinforcement-learning +1

Paper
Add Code

Potential-Based Advice for Stochastic Policy Learning

no code implementations • 20 Jul 2019 • Baicen Xiao, Bhaskar Ramasubramanian, Andrew Clark, Hannaneh Hajishirzi, Linda Bushnell, Radha Poovendran

This paper augments the reward received by a reinforcement learning agent with potential functions in order to help the agent learn (possibly stochastic) optimal policies.

Q-Learning

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.