no code implementations • 11 Dec 2023 • Dashiell Stander, Qinan Yu, Honglu Fan, Stella Biderman
We use the group Fourier transform over the symmetric group $S_n$ to reverse engineer a 1-layer feedforward network that has "grokked" the multiplication of $S_5$ and $S_6$.
1 code implementation • 18 Apr 2022 • Katherine Crowson, Stella Biderman, Daniel Kornis, Dashiell Stander, Eric Hallahan, Louis Castricato, Edward Raff
Generating and editing images from open domain text prompts is a challenging task that heretofore has required expensive and specially trained models.