Search Results for author: Ethan Hall

Found 2 papers, 0 papers with code

Understanding Transformer Reasoning Capabilities via Graph Algorithms

no code implementations28 May 2024 Clayton Sanford, Bahare Fatemi, Ethan Hall, Anton Tsitsulin, Mehran Kazemi, Jonathan Halcrow, Bryan Perozzi, Vahab Mirrokni

Our novel representational hierarchy separates 9 algorithmic reasoning problems into classes solvable by transformers in different realistic parameter scaling regimes.

Retrieval

RLAIF vs. RLHF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

no code implementations1 Sep 2023 Harrison Lee, Samrat Phatale, Hassan Mansoor, Thomas Mesnard, Johan Ferret, Kellie Lu, Colton Bishop, Ethan Hall, Victor Carbune, Abhinav Rastogi, Sushant Prakash

Reinforcement learning from human feedback (RLHF) has proven effective in aligning large language models (LLMs) with human preferences, but gathering high-quality preference labels is expensive.

Dialogue Generation reinforcement-learning

Cannot find the paper you are looking for? You can Submit a new open access paper.