Text Retrieval

243 papers with code • 5 benchmarks • 15 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Text Retrieval

Dataset	Best Model	Compare
MTEB	SGPT-5.8B-msmarco	See all
Image-Chat	PaCE	See all
20 Newsgroups	B-VAE	See all
Reuters-21578	VDSH	See all
RSICD	GeoRSCLIP-FT	See all

Libraries

Use these libraries to find Text Retrieval models and implementations

towhee-io/towhee

5 papers

3,013

huggingface/transformers

2 papers

126,027

salesforce/lavis

2 papers

8,848

modelscope/modelscope

2 papers

6,149

See all 6 libraries.

Datasets

Most implemented papers

Most implemented Social Latest No code

UNITER: UNiversal Image-TExt Representation Learning

ChenRocks/UNITER • • ECCV 2020

Different from previous work that applies joint random masking to both modalities, we use conditional masking on pre-training tasks (i. e., masked language/region modeling is conditioned on full observation of image/text).

Paper
Code

Knowledge Guided Text Retrieval and Reading for Open Domain Question Answering

huggingface/transformers • • 10 Nov 2019

We introduce an approach for open-domain question answering (QA) that retrieves and reads a passage graph, where vertices are passages of text and edges represent relationships that are derived from an external knowledge base or co-occurrence in the same article.

Paper
Code

Stacked Cross Attention for Image-Text Matching

kuanghuei/SCAN • • ECCV 2018

Prior work either simply aggregates the similarity of all possible pairs of regions and words without attending differentially to more and less important words or regions, or uses a multi-step attentional process to capture limited number of semantic alignments which is less interpretable.

Paper
Code

Language-agnostic BERT Sentence Embedding

FreddeFrallan/Multilingual-CLIP • • ACL 2022

While BERT is an effective method for learning monolingual sentence embeddings for semantic similarity and embedding based transfer learning (Reimers and Gurevych, 2019), BERT based cross-lingual sentence embeddings have yet to be explored.

Paper
Code

BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

salesforce/lavis • • 28 Jan 2022

Furthermore, performance improvement has been largely achieved by scaling up the dataset with noisy image-text pairs collected from the web, which is a suboptimal source of supervision.

Paper
Code

Learning a Text-Video Embedding from Incomplete and Heterogeneous Data

antoine77340/Mixture-of-Embedding-Experts • • 7 Apr 2018

We evaluate our method on the task of video retrieval and report results for the MPII Movie Description and MSR-VTT datasets.

Paper
Code

Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval

microsoft/ANCE • • ICLR 2021

In this paper, we identify that the main bottleneck is in the training mechanisms, where the negative instances used in training are not representative of the irrelevant documents in testing.

Paper
Code