Multi-Instance Retrieval

8 papers with code • 1 benchmarks • 1 datasets

This task has no description! Would you like to contribute one?

Libraries

Use these libraries to find Multi-Instance Retrieval models and implementations
2 papers
205

Most implemented papers

Learning video retrieval models with relevance-aware online mining

aranciokov/ranp 16 Mar 2022

Due to the amount of videos and related captions uploaded every hour, deep learning-based solutions for cross-modal video retrieval are attracting more and more attention.

Egocentric Video-Language Pretraining

showlab/egovlp 3 Jun 2022

Video-Language Pretraining (VLP), which aims to learn transferable representation to advance a wide range of video-text downstream tasks, has recently received increasing attention.

Learning Video Representations from Large Language Models

facebookresearch/lavila CVPR 2023

We introduce LaViLa, a new approach to learning video-language representations by leveraging Large Language Models (LLMs).

Relevance-based Margin for Contrastively-trained Video Retrieval Models

aranciokov/relevancemargin-icmr22 27 Apr 2022

We show that even if we carefully tuned the fixed margin, our technique (which does not have the margin as a hyper-parameter) would still achieve better performance.

Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022

showlab/egovlp 4 Jul 2022

In this report, we propose a video-language pretraining (VLP) based solution \cite{kevin2022egovlp} for the EPIC-KITCHENS-100 Multi-Instance Retrieval (MIR) challenge.

EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone

facebookresearch/EgoVLPv2 ICCV 2023

Video-language pre-training (VLP) has become increasingly important due to its ability to generalize to various vision and language tasks.

Training a Large Video Model on a Single Machine in a Day

zhaoyue-zephyrus/avion 28 Sep 2023

Videos are big, complex to pre-process, and slow to train on.