1 code implementation • NeurIPS 2023 • Jacob Portes, Alex Trott, Sam Havens, Daniel King, Abhinav Venigalla, Moin Nadeem, Nikhil Sardana, Daya Khudia, Jonathan Frankle
Here, we introduce MosaicBERT, a BERT-style encoder architecture and training recipe that is empirically optimized for fast pretraining.
no code implementations • 22 Nov 2023 • Aditi Jha, Sam Havens, Jeremey Dohmann, Alex Trott, Jacob Portes
We find that subsets of 1k-6k instruction finetuning samples are sufficient to achieve good performance on both (1) traditional NLP benchmarks and (2) model-based evaluation.
no code implementations • 25 Sep 2019 • Wenling Shang, Alex Trott, Stephan Zheng, Caiming Xiong, Richard Socher
Efficiently learning to solve tasks in complex environments is a key challenge for reinforcement learning (RL) agents.
no code implementations • 1 Jul 2019 • Wenling Shang, Alex Trott, Stephan Zheng, Caiming Xiong, Richard Socher
We perform a thorough ablation study to evaluate our approach on a suite of challenging maze tasks, demonstrating significant advantages from the proposed framework over baselines that lack world graph knowledge in terms of performance and efficiency.
Hierarchical Reinforcement Learning reinforcement-learning +2