no code implementations • 4 Feb 2025 • Mateusz Piotrowski, Paul M. Riechers, Daniel Filan, Adam S. Shai
What computational structures emerge in transformers trained on next-token prediction?
Bayesian Inference Prediction