Search Results for author: Md Masudur Rahman

Our approach is to estimate the value function from prior computations, such as from the Q-network learned in DQN or the value function trained for different but related environments.

Policy Gradient Methods

Paper
Add Code

Robust Policy Optimization in Deep Reinforcement Learning

1 code implementation • 14 Dec 2022 • Md Masudur Rahman, Yexiang Xue

We observed that in many settings, RPO increases the policy entropy early in training and then maintains a certain level of entropy throughout the training period.

Continuous Control Data Augmentation +3

4,453

Paper
Code

Bootstrap Advantage Estimation for Policy Optimization in Reinforcement Learning

1 code implementation • 13 Oct 2022 • Md Masudur Rahman, Yexiang Xue

Unlike using data augmentation on the input to learn value and policy function as existing methods use, our method uses data augmentation to compute a bootstrap advantage estimation.

Data Augmentation reinforcement-learning +1

Paper
Code

Bootstrap State Representation using Style Transfer for Better Generalization in Deep Reinforcement Learning

1 code implementation • 15 Jul 2022 • Md Masudur Rahman, Yexiang Xue

Deep Reinforcement Learning (RL) agents often overfit the training environment, leading to poor generalization performance.

Data Augmentation Reinforcement Learning (RL) +1

Paper
Code

Adversarial Style Transfer for Robust Policy Optimization in Reinforcement Learning

no code implementations • 29 Sep 2021 • Md Masudur Rahman, Yexiang Xue

An additional goal of the generator is to perturb the observation, which maximizes the agent's probability of taking a different action.

reinforcement-learning Reinforcement Learning (RL) +1

Paper
Add Code

DESK: A Robotic Activity Dataset for Dexterous Surgical Skills Transfer to Medical Robots

1 code implementation • 3 Mar 2019 • Naveen Madapana, Md Masudur Rahman, Natalia Sanchez-Tamayo, Mythra V. Balakuntala, Glebys Gonzalez, Jyothsna Padmakumar Bindu, L. N. Vishnunandan Venkatesh, Xingguang Zhang, Juan Barragan Noguera, Thomas Low, Richard Voyles, Yexiang Xue, Juan Wachs

It comprises a set of surgical robotic skills collected during a surgical training task using three robotic platforms: the Taurus II robot, Taurus II simulated robot, and the YuMi robot.

Robotics

Paper
Code

A Case Study on the Impact of Similarity Measure on Information Retrieval based Software Engineering Tasks

no code implementations • 8 Aug 2018 • Md Masudur Rahman, Saikat Chakraborty, Gail Kaiser, Baishakhi Ray

In particular, we analyze two previously proposed tools for project recommendation and bug localization tasks, which leverage diverse software artifacts, and observe that an informed choice of similarity measure indeed leads to improved performance of the existing SE tools.

Information Retrieval Retrieval

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.