1 code implementation • 27 May 2024 • Soyuj Basnet, Jerry Gou, Antonio Mallia, Torsten Suel
A lot of recent work has focused on sparse learned indexes that use deep neural architectures to significantly improve retrieval quality while keeping the efficiency benefits of the inverted index.
1 code implementation • 2 May 2024 • Antonio Mallia, Torten Suel, Nicola Tonellotto
Learned sparse retrieval systems aim to combine the effectiveness of contextualized language models with the scalability of conventional data structures such as inverted indexes.
1 code implementation • 12 Jan 2024 • Puxuan Yu, Antonio Mallia, Matthias Petri
We explore leveraging corpus-specific vocabularies that improve both efficiency and effectiveness of learned sparse retrieval systems.
1 code implementation • 24 Apr 2022 • Antonio Mallia, Joel Mackenzie, Torsten Suel, Nicola Tonellotto
Neural information retrieval architectures based on transformers such as BERT are able to significantly improve system effectiveness over traditional sparse models such as BM25.
1 code implementation • 24 Apr 2021 • Antonio Mallia, Omar Khattab, Nicola Tonellotto, Torsten Suel
Neural information retrieval systems typically use a cascading pipeline, in which a first-stage model retrieves a candidate set of documents and one or more subsequent stages re-rank this set using contextualized language models such as BERT.
2 code implementations • 18 Mar 2020 • Jimmy Lin, Joel Mackenzie, Chris Kamphuis, Craig Macdonald, Antonio Mallia, Michał Siedlaczek, Andrew Trotman, Arjen de Vries
There exists a natural tension between encouraging a diverse ecosystem of open-source search engines and supporting fair, replicable comparisons across those systems.
1 code implementation • 8 Aug 2018 • Melanie Tosik, Antonio Mallia, Kedar Gangopadhyay
Identifying the stance of a news article body with respect to a certain headline is the first step to automated fake news detection.