Search Results for author: Christopher Honey

Found 3 papers, 2 papers with code

Characterizing Verbatim Short-Term Memory in Neural Language Models

1 code implementation24 Oct 2022 Kristijan Armeni, Christopher Honey, Tal Linzen

We tested whether language models could retrieve the exact words that occurred previously in a text.

Language Modelling Retrieval

Short-term memory in neural language models

no code implementations29 Sep 2021 Kristijan Armeni, Christopher Honey, Tal Linzen

Thus, although the transformer and LSTM architectures were both trained to predict language sequences, only the transformer learned to flexibly index prior tokens.

Language Modelling Retrieval

Cannot find the paper you are looking for? You can Submit a new open access paper.