Search Results for author: Kristijan Armeni

Found 3 papers, 2 papers with code

Transformer verbatim in-context retrieval across time and scale

1 code implementation11 Nov 2024 Kristijan Armeni, Marko Pranjić, Senja Pollak

We further found that the development of verbatim in-context retrieval is positively correlated with the learning of zero-shot benchmarks.


Characterizing Verbatim Short-Term Memory in Neural Language Models

1 code implementation24 Oct 2022 Kristijan Armeni, Christopher Honey, Tal Linzen

We tested whether language models could retrieve the exact words that occurred previously in a text.

Language Modelling Retrieval

Short-term memory in neural language models

no code implementations29 Sep 2021 Kristijan Armeni, Christopher Honey, Tal Linzen

Thus, although the transformer and LSTM architectures were both trained to predict language sequences, only the transformer learned to flexibly index prior tokens.

Language Modelling Retrieval

Cannot find the paper you are looking for? You can Submit a new open access paper.