Search Results for author: Ajin Joseph

Found 2 papers, 1 papers with code

Two-Timescale Networks for Nonlinear Value Function Approximation

no code implementations ICLR 2019 Wesley Chung, Somjit Nath, Ajin Joseph, Martha White

A key component for many reinforcement learning agents is to learn a value function, either for policy evaluation or control.

Q-Learning Vocal Bursts Valence Prediction

Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement

1 code implementation22 Oct 2018 Samuel Neumann, Sungsu Lim, Ajin Joseph, Yangchen Pan, Adam White, Martha White

We first provide a policy improvement result in an idealized setting, and then prove that our conditional CEM (CCEM) strategy tracks a CEM update per state, even with changing action-values.

Policy Gradient Methods Q-Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.