no code implementations • 14 Dec 2023 • Rhys Gould, Euan Ong, George Ogden, Arthur Conmy
In this work we present successor heads: attention heads that increment tokens with a natural ordering, such as numbers, months, and days.
Language Modelling