Search Results for author: Maximilian Schlegel

Found 1 papers, 0 papers with code

Uncovering mesa-optimization algorithms in Transformers

no code implementations11 Sep 2023 Johannes von Oswald, Maximilian Schlegel, Alexander Meulemans, Seijin Kobayashi, Eyvind Niklasson, Nicolas Zucchet, Nino Scherrer, Nolan Miller, Mark Sandler, Blaise Agüera y Arcas, Max Vladymyrov, Razvan Pascanu, João Sacramento

Some autoregressive models exhibit in-context learning capabilities: being able to learn as an input sequence is processed, without undergoing any parameter changes, and without being explicitly trained to do so.

In-Context Learning Language Modelling

Cannot find the paper you are looking for? You can Submit a new open access paper.