Search Results for author: Swapnil Pande

Found 1 papers, 0 papers with code

Addressing Optimism Bias in Sequence Modeling for Reinforcement Learning

no code implementations21 Jul 2022 Adam Villaflor, Zhe Huang, Swapnil Pande, John Dolan, Jeff Schneider

Impressive results in natural language processing (NLP) based on the Transformer neural network architecture have inspired researchers to explore viewing offline reinforcement learning (RL) as a generic sequence modeling problem.

Autonomous Driving D4RL +2

Cannot find the paper you are looking for? You can Submit a new open access paper.