Search Results for author: Keiran Paster

Found 7 papers, 4 papers with code

OpenWebMath: An Open Dataset of High-Quality Mathematical Web Text

2 code implementations10 Oct 2023 Keiran Paster, Marco Dos Santos, Zhangir Azerbayev, Jimmy Ba

We hope that our dataset, openly released on the Hugging Face Hub, will help spur advances in the reasoning abilities of large language models.

Large Language Models Are Human-Level Prompt Engineers

2 code implementations3 Nov 2022 Yongchao Zhou, Andrei Ioan Muresanu, Ziwen Han, Keiran Paster, Silviu Pitis, Harris Chan, Jimmy Ba

By conditioning on natural language instructions, large language models (LLMs) have displayed impressive capabilities as general-purpose computers.

Few-Shot Learning In-Context Learning +3

You Can't Count on Luck: Why Decision Transformers and RvS Fail in Stochastic Environments

no code implementations31 May 2022 Keiran Paster, Sheila Mcilraith, Jimmy Ba

In all tested domains, ESPER achieves significantly better alignment between the target return and achieved return than simply conditioning on returns.

Offline RL Playing the Game of 2048

Planning from Pixels using Inverse Dynamics Models

no code implementations ICLR 2021 Keiran Paster, Sheila A. McIlraith, Jimmy Ba

Learning task-agnostic dynamics models in high-dimensional observation spaces can be challenging for model-based RL agents.

Cannot find the paper you are looking for? You can Submit a new open access paper.