Composed Image Retrieval (CoIR)
12 papers with code • 1 benchmarks • 5 datasets
Composed Image Retrieval (CoIR) is the task involves retrieving images from a large database based on a query composed of multiple elements, such as text, images, and sketches. The goal is to develop algorithms that can understand and combine multiple sources of information to accurately retrieve images that match the query, extending the user’s expression ability.
Most implemented papers
CoVR: Learning Composed Video Retrieval from Web Video Captions
Most CoIR approaches require manually annotated datasets, comprising image-text-image triplets, where the text describes a modification from the query image to the target image.
Sentence-level Prompts Benefit Composed Image Retrieval
Composed image retrieval (CIR) is the task of retrieving specific images by using a query that involves both a reference image and a relative caption.