Search Results for author: Kristijan Armeni

Found 2 papers, 1 papers with code

Characterizing Verbatim Short-Term Memory in Neural Language Models

1 code implementation • 24 Oct 2022 • Kristijan Armeni, Christopher Honey, Tal Linzen

We tested whether language models could retrieve the exact words that occurred previously in a text.

Paper
Code

Short-term memory in neural language models

no code implementations • 29 Sep 2021 • Kristijan Armeni, Christopher Honey, Tal Linzen

Thus, although the transformer and LSTM architectures were both trained to predict language sequences, only the transformer learned to flexibly index prior tokens.

Language Modelling Retrieval

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.