Search Results for author: Baicen Xiao

Found 11 papers, 3 papers with code

Shaping Advice in Deep Reinforcement Learning

1 code implementation • 19 Feb 2022 • Baicen Xiao, Bhaskar Ramasubramanian, Radha Poovendran

We design two algorithms- Shaping Advice in Single-agent reinforcement learning (SAS) and Shaping Advice in Multi-agent reinforcement learning (SAM).

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Code

Agent-Temporal Attention for Reward Redistribution in Episodic Multi-Agent Reinforcement Learning

1 code implementation • 12 Jan 2022 • Baicen Xiao, Bhaskar Ramasubramanian, Radha Poovendran

In this paper, we introduce Agent-Temporal Attention for Reward Redistribution in Episodic Multi-Agent Reinforcement Learning (AREL) to address these two challenges.

Multi-agent Reinforcement Learning reinforcement-learning +2

Paper
Code

Shaping Advice in Deep Multi-Agent Reinforcement Learning

1 code implementation • 29 Mar 2021 • Baicen Xiao, Bhaskar Ramasubramanian, Radha Poovendran

We observe that using SAM results in agents learning policies to complete tasks faster, and obtain higher rewards than: i) using sparse rewards alone; ii) a state-of-the-art reward redistribution method.

Multi-agent Reinforcement Learning reinforcement-learning +1

Paper
Code

Safety-Critical Online Control with Adversarial Disturbances

no code implementations • 20 Sep 2020 • Bhaskar Ramasubramanian, Baicen Xiao, Linda Bushnell, Radha Poovendran

We propose an iterative approach to the synthesis of the controller by solving a modified discrete-time Riccati equation.

Paper
Add Code

FRESH: Interactive Reward Shaping in High-Dimensional State Spaces using Human Feedback

no code implementations • 19 Jan 2020 • Baicen Xiao, Qifan Lu, Bhaskar Ramasubramanian, Andrew Clark, Linda Bushnell, Radha Poovendran

The output of the feedback neural network is converted to a shaping reward that is augmented to the reward provided by the environment.

Atari Games reinforcement-learning +1

Paper
Add Code

Potential-Based Advice for Stochastic Policy Learning

no code implementations • 20 Jul 2019 • Baicen Xiao, Bhaskar Ramasubramanian, Andrew Clark, Hannaneh Hajishirzi, Linda Bushnell, Radha Poovendran

This paper augments the reward received by a reinforcement learning agent with potential functions in order to help the agent learn (possibly stochastic) optimal policies.

Q-Learning

Paper
Add Code

Assessing Shape Bias Property of Convolutional Neural Networks

no code implementations • 21 Mar 2018 • Hossein Hosseini, Baicen Xiao, Mayoore Jaiswal, Radha Poovendran

In order to conduct large scale experiments, we propose using the model accuracy on images with reversed brightness as a metric to evaluate the shape bias property.

One-Shot Learning

Paper
Add Code

Attacking Automatic Video Analysis Algorithms: A Case Study of Google Cloud Video Intelligence API

no code implementations • 14 Aug 2017 • Hossein Hosseini, Baicen Xiao, Andrew Clark, Radha Poovendran

At the end, we propose introducing randomness to video analysis algorithms as a countermeasure to our attacks.

Video Classification

Paper
Add Code

Google's Cloud Vision API Is Not Robust To Noise

no code implementations • 16 Apr 2017 • Hossein Hosseini, Baicen Xiao, Radha Poovendran

For example, an adversary can bypass an image filtering system by adding noise to inappropriate images.

Paper
Add Code

Deceiving Google's Cloud Video Intelligence API Built for Summarizing Videos

no code implementations • 26 Mar 2017 • Hossein Hosseini, Baicen Xiao, Radha Poovendran

For this, we select an image, which is different from the video content, and insert it, periodically and at a very low rate, into the video.

Image Classification

Paper
Add Code

On the Limitation of Convolutional Neural Networks in Recognizing Negative Images

no code implementations • 20 Mar 2017 • Hossein Hosseini, Baicen Xiao, Mayoore Jaiswal, Radha Poovendran

To this end, we evaluate CNNs on negative images, since they share the same structure and semantics as regular images and humans can classify them correctly.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.