1 code implementation • 19 Feb 2022 • Baicen Xiao, Bhaskar Ramasubramanian, Radha Poovendran
We design two algorithms- Shaping Advice in Single-agent reinforcement learning (SAS) and Shaping Advice in Multi-agent reinforcement learning (SAM).
Multi-agent Reinforcement Learning reinforcement-learning +1
1 code implementation • 12 Jan 2022 • Baicen Xiao, Bhaskar Ramasubramanian, Radha Poovendran
In this paper, we introduce Agent-Temporal Attention for Reward Redistribution in Episodic Multi-Agent Reinforcement Learning (AREL) to address these two challenges.
Multi-agent Reinforcement Learning reinforcement-learning +2
1 code implementation • 29 Mar 2021 • Baicen Xiao, Bhaskar Ramasubramanian, Radha Poovendran
We observe that using SAM results in agents learning policies to complete tasks faster, and obtain higher rewards than: i) using sparse rewards alone; ii) a state-of-the-art reward redistribution method.
Multi-agent Reinforcement Learning reinforcement-learning +1
no code implementations • 20 Sep 2020 • Bhaskar Ramasubramanian, Baicen Xiao, Linda Bushnell, Radha Poovendran
We propose an iterative approach to the synthesis of the controller by solving a modified discrete-time Riccati equation.
no code implementations • 19 Jan 2020 • Baicen Xiao, Qifan Lu, Bhaskar Ramasubramanian, Andrew Clark, Linda Bushnell, Radha Poovendran
The output of the feedback neural network is converted to a shaping reward that is augmented to the reward provided by the environment.
no code implementations • 20 Jul 2019 • Baicen Xiao, Bhaskar Ramasubramanian, Andrew Clark, Hannaneh Hajishirzi, Linda Bushnell, Radha Poovendran
This paper augments the reward received by a reinforcement learning agent with potential functions in order to help the agent learn (possibly stochastic) optimal policies.
no code implementations • 21 Mar 2018 • Hossein Hosseini, Baicen Xiao, Mayoore Jaiswal, Radha Poovendran
In order to conduct large scale experiments, we propose using the model accuracy on images with reversed brightness as a metric to evaluate the shape bias property.
no code implementations • 14 Aug 2017 • Hossein Hosseini, Baicen Xiao, Andrew Clark, Radha Poovendran
At the end, we propose introducing randomness to video analysis algorithms as a countermeasure to our attacks.
no code implementations • 16 Apr 2017 • Hossein Hosseini, Baicen Xiao, Radha Poovendran
For example, an adversary can bypass an image filtering system by adding noise to inappropriate images.
no code implementations • 26 Mar 2017 • Hossein Hosseini, Baicen Xiao, Radha Poovendran
For this, we select an image, which is different from the video content, and insert it, periodically and at a very low rate, into the video.
no code implementations • 20 Mar 2017 • Hossein Hosseini, Baicen Xiao, Mayoore Jaiswal, Radha Poovendran
To this end, we evaluate CNNs on negative images, since they share the same structure and semantics as regular images and humans can classify them correctly.