Search Results for author: Matt Peng

Found 2 papers, 0 papers with code

An Adaptive State Aggregation Algorithm for Markov Decision Processes

no code implementations23 Jul 2021 Guanting Chen, Johann Demetrio Gaebler, Matt Peng, Chunlin Sun, Yinyu Ye

Value iteration is a well-known method of solving Markov Decision Processes (MDPs) that is simple to implement and boasts strong theoretical convergence guarantees.

Linear Representation Meta-Reinforcement Learning for Instant Adaptation

no code implementations12 Jan 2021 Matt Peng, Banghua Zhu, Jiantao Jiao

This paper introduces Fast Linearized Adaptive Policy (FLAP), a new meta-reinforcement learning (meta-RL) method that is able to extrapolate well to out-of-distribution tasks without the need to reuse data from training, and adapt almost instantaneously with the need of only a few samples during testing.

Continuous Control Meta Reinforcement Learning +2

Cannot find the paper you are looking for? You can Submit a new open access paper.