Browse > Computer Vision > Image Retrieval > Text-Image Retrieval

Text-Image Retrieval

5 papers with code · Computer Vision
Subtask of Image Retrieval

It include two tasks: (1) Image as Query and Text as Targets; (2) Text as Query and Image as Targets.

Leaderboards

Greatest papers with code

Deep Visual-Semantic Alignments for Generating Image Descriptions

CVPR 2015 karpathy/neuraltalk2

Our approach leverages datasets of images and their sentence descriptions to learn about the inter-modal correspondences between language and visual data.

IMAGE CAPTIONING TEXT-IMAGE RETRIEVAL

Dual-Path Convolutional Image-Text Embedding with Instance Loss

15 Nov 2017layumi/Image-Text-Embedding

In this paper, we propose a new system to discriminatively embed the image and text to a shared visual-textual space.

CONTENT-BASED IMAGE RETRIEVAL CROSS-MODAL RETRIEVAL PERSON RETRIEVAL TEXT-IMAGE RETRIEVAL

Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks

13 Apr 2020microsoft/Oscar

Large-scale pre-training methods of learning cross-modal representations on image-text pairs are becoming popular for vision-language tasks.

TEXT-IMAGE RETRIEVAL VISUAL QUESTION ANSWERING

Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal Pre-training

16 Aug 2019Luka0612/ChineseVLBert

We propose Unicoder-VL, a universal encoder that aims to learn joint representations of vision and language in a pre-training manner.

LANGUAGE MODELLING OBJECT CLASSIFICATION TEXT-IMAGE RETRIEVAL VISUAL COMMONSENSE REASONING