Search Results for author: Alexander Gietelink Oldenziel

Found 1 papers, 0 papers with code

Transformers represent belief state geometry in their residual stream

no code implementations24 May 2024 Adam S. Shai, Sarah E. Marzen, Lucas Teixeira, Alexander Gietelink Oldenziel, Paul M. Riechers

What computational structure are we building into large language models when we train them on next-token prediction?

Prediction

Cannot find the paper you are looking for? You can Submit a new open access paper.