Search Results for author: Pulkit Katdare

Found 3 papers, 1 papers with code

Towards Provable Log Density Policy Gradient

no code implementations • 3 Mar 2024 • Pulkit Katdare, Anant Joshi, Katherine Driggs-Campbell

In this work, we argue that this residual term is significant and correcting for it could potentially improve sample-complexity of reinforcement learning methods.

Policy Gradient Methods reinforcement-learning

Paper
Add Code

Marginalized Importance Sampling for Off-Environment Policy Evaluation

no code implementations • 4 Sep 2023 • Pulkit Katdare, Nan Jiang, Katherine Driggs-Campbell

This paper proposes a new approach to evaluate the real-world performance of agent policies prior to deploying them in the real world.

Reinforcement Learning (RL)

Paper
Add Code

Off Environment Evaluation Using Convex Risk Minimization

1 code implementation • 21 Dec 2021 • Pulkit Katdare, Shuijing Liu, Katherine Driggs-Campbell

We also show that the our method is able to estimate performance of a 7 DOF robotic arm using the simulator and remotely collected data from the robot in the real world.

Reinforcement Learning (RL)

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.