Search Results for author: Fengdi Che

Found 3 papers, 0 papers with code

Correcting discount-factor mismatch in on-policy policy gradient methods

no code implementations • 23 Jun 2023 • Fengdi Che, Gautham Vasan, A. Rupam Mahmood

The policy gradient theorem gives a convenient form of the policy gradient in terms of three factors: an action value, a gradient of the action likelihood, and a state distribution involving discounting called the \emph{discounted stationary distribution}.

OpenAI Gym Policy Gradient Methods

Paper
Add Code

Bayesian Q-learning With Imperfect Expert Demonstrations

no code implementations • 1 Oct 2022 • Fengdi Che, Xiru Zhu, Doina Precup, David Meger, Gregory Dudek

Guided exploration with expert demonstrations improves data efficiency for reinforcement learning, but current algorithms often overuse expert information.

Atari Games Q-Learning +2

Paper
Add Code

Detecting GAN generated errors

no code implementations • 2 Dec 2019 • Xiru Zhu, Fengdi Che, Tianzi Yang, Tzuyang Yu, David Meger, Gregory Dudek

This is because the task of evaluating the quality of a generated image differs from deciding if an image is real or fake.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.