Search Results for author: Junqi Qian

Found 1 papers, 0 papers with code

Learning Rewards to Optimize Global Performance Metrics in Deep Reinforcement Learning

no code implementations16 Mar 2023 Junqi Qian, Paul Weng, Chenmien Tan

LR4GPM alternates between two phases: (1) learning a (possibly vector) reward function used to fit the performance metric, and (2) training a policy to optimize an approximation of this performance metric based on the learned rewards.

Autonomous Driving reinforcement-learning +1

Cannot find the paper you are looking for? You can Submit a new open access paper.