Search Results for author: Md Masudur Rahman

Found 10 papers, 5 papers with code

Adversarial Style Transfer for Robust Policy Optimization in Deep Reinforcement Learning

no code implementations29 Aug 2023 Md Masudur Rahman, Yexiang Xue

An additional goal of the generator is to perturb the observation, which maximizes the agent's probability of taking a different action.

Data Augmentation reinforcement-learning +1

Adversarial Policy Optimization in Deep Reinforcement Learning

no code implementations27 Apr 2023 Md Masudur Rahman, Yexiang Xue

Data augmentation can provide a performance boost to RL agents by mitigating the effect of overfitting.

Data Augmentation reinforcement-learning

Accelerating Policy Gradient by Estimating Value Function from Prior Computation in Deep Reinforcement Learning

no code implementations2 Feb 2023 Md Masudur Rahman, Yexiang Xue

Our approach is to estimate the value function from prior computations, such as from the Q-network learned in DQN or the value function trained for different but related environments.

Policy Gradient Methods

Robust Policy Optimization in Deep Reinforcement Learning

1 code implementation14 Dec 2022 Md Masudur Rahman, Yexiang Xue

We observed that in many settings, RPO increases the policy entropy early in training and then maintains a certain level of entropy throughout the training period.

Continuous Control Data Augmentation +3

Bootstrap Advantage Estimation for Policy Optimization in Reinforcement Learning

1 code implementation13 Oct 2022 Md Masudur Rahman, Yexiang Xue

Unlike using data augmentation on the input to learn value and policy function as existing methods use, our method uses data augmentation to compute a bootstrap advantage estimation.

Data Augmentation reinforcement-learning +1

Adversarial Style Transfer for Robust Policy Optimization in Reinforcement Learning

no code implementations29 Sep 2021 Md Masudur Rahman, Yexiang Xue

An additional goal of the generator is to perturb the observation, which maximizes the agent's probability of taking a different action.

reinforcement-learning Reinforcement Learning (RL) +1

DESK: A Robotic Activity Dataset for Dexterous Surgical Skills Transfer to Medical Robots

1 code implementation3 Mar 2019 Naveen Madapana, Md Masudur Rahman, Natalia Sanchez-Tamayo, Mythra V. Balakuntala, Glebys Gonzalez, Jyothsna Padmakumar Bindu, L. N. Vishnunandan Venkatesh, Xingguang Zhang, Juan Barragan Noguera, Thomas Low, Richard Voyles, Yexiang Xue, Juan Wachs

It comprises a set of surgical robotic skills collected during a surgical training task using three robotic platforms: the Taurus II robot, Taurus II simulated robot, and the YuMi robot.

Robotics

A Case Study on the Impact of Similarity Measure on Information Retrieval based Software Engineering Tasks

no code implementations8 Aug 2018 Md Masudur Rahman, Saikat Chakraborty, Gail Kaiser, Baishakhi Ray

In particular, we analyze two previously proposed tools for project recommendation and bug localization tasks, which leverage diverse software artifacts, and observe that an informed choice of similarity measure indeed leads to improved performance of the existing SE tools.

Information Retrieval Retrieval

Cannot find the paper you are looking for? You can Submit a new open access paper.