Search Results for author: Paria Rashidinejad

Found 3 papers, 2 papers with code

MADE: Exploration via Maximizing Deviation from Explored Regions

1 code implementation18 Jun 2021 Tianjun Zhang, Paria Rashidinejad, Jiantao Jiao, Yuandong Tian, Joseph Gonzalez, Stuart Russell

As a proof of concept, we evaluate the new intrinsic reward on tabular examples across a variety of model-based and model-free algorithms, showing improvements over count-only exploration strategies.

Efficient Exploration

Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism

no code implementations22 Mar 2021 Paria Rashidinejad, Banghua Zhu, Cong Ma, Jiantao Jiao, Stuart Russell

Based on the composition of the offline dataset, two main categories of methods are used: imitation learning which is suitable for expert datasets and vanilla offline RL which often requires uniform coverage datasets.

Imitation Learning Multi-Armed Bandits +1

SLIP: Learning to Predict in Unknown Dynamical Systems with Long-Term Memory

1 code implementation NeurIPS 2020 Paria Rashidinejad, Jiantao Jiao, Stuart Russell

Our theoretical and experimental results shed light on the conditions required for efficient probably approximately correct (PAC) learning of the Kalman filter from partially observed data.

Cannot find the paper you are looking for? You can Submit a new open access paper.