Search Results for author: Brian K Chen

Found 1 papers, 0 papers with code

Exact Conversion of In-Context Learning to Model Weights in Linearized-Attention Transformers

no code implementations5 Jun 2024 Brian K Chen, Tianyang Hu, Hui Jin, Hwee Kuan Lee, Kenji Kawaguchi

We further suggest how our method can be adapted to achieve cheap approximate conversion of ICL tokens, even in regular transformer networks that are not linearized.

In-Context Learning

Cannot find the paper you are looking for? You can Submit a new open access paper.