Search Results for author: Surya P. N. Singh

Found 3 papers, 2 papers with code

Soft Actor-Critic with Cross-Entropy Policy Optimization

1 code implementation21 Dec 2021 Zhenyang Shi, Surya P. N. Singh

For the purpose of reducing the computational complexity, we also introduce a decoupled policy structure that decouples the Gaussian policy into one policy that learns the mean and one other policy that learns the deviation such that only the mean policy is trained by CEM.

Reinforcement Learning (RL)

LiMIIRL: Lightweight Multiple-Intent Inverse Reinforcement Learning

no code implementations3 Jun 2021 Aaron J. Snoswell, Surya P. N. Singh, Nan Ye

Multiple-Intent Inverse Reinforcement Learning (MI-IRL) seeks to find a reward function ensemble to rationalize demonstrations of different but unlabelled intents.

Clustering reinforcement-learning +1

Revisiting Maximum Entropy Inverse Reinforcement Learning: New Perspectives and Algorithms

1 code implementation1 Dec 2020 Aaron J. Snoswell, Surya P. N. Singh, Nan Ye

This improves the previous heuristic derivation of the MaxEnt IRL model (for stochastic MDPs), allows a unified view of MaxEnt IRL and Relative Entropy IRL, and leads to a model-free learning algorithm for the MaxEnt IRL model.

OpenAI Gym reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.