Multi-Instance Retrieval

8 papers with code • 1 benchmarks • 1 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Multi-Instance Retrieval

Trend	Dataset	Best Model	Paper	Code	Compare
	EPIC-KITCHENS-100	Avion (ViT-L)			See all

Libraries

Use these libraries to find Multi-Instance Retrieval models and implementations

showlab/egovlp

2 papers

205

Datasets

EPIC-KITCHENS-100

Most implemented papers

Most implemented Social Latest No code

Learning video retrieval models with relevance-aware online mining

aranciokov/ranp • • 16 Mar 2022

Due to the amount of videos and related captions uploaded every hour, deep learning-based solutions for cross-modal video retrieval are attracting more and more attention.

Paper
Code

Egocentric Video-Language Pretraining

showlab/egovlp • • 3 Jun 2022

Video-Language Pretraining (VLP), which aims to learn transferable representation to advance a wide range of video-text downstream tasks, has recently received increasing attention.

Paper
Code

Learning Video Representations from Large Language Models

facebookresearch/lavila • • CVPR 2023

We introduce LaViLa, a new approach to learning video-language representations by leveraging Large Language Models (LLMs).

Paper
Code

Relevance-based Margin for Contrastively-trained Video Retrieval Models

aranciokov/relevancemargin-icmr22 • • 27 Apr 2022

We show that even if we carefully tuned the fixed margin, our technique (which does not have the margin as a hyper-parameter) would still achieve better performance.

Paper
Code

Exploiting Semantic Role Contextualized Video Features for Multi-Instance Text-Video Retrieval EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022

buraksatar/RoME_video_retrieval • • 29 Jun 2022

In this report, we present our approach for EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022.

Paper
Code

Egocentric Video-Language Pretraining @ EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2022

showlab/egovlp • • 4 Jul 2022

In this report, we propose a video-language pretraining (VLP) based solution \cite{kevin2022egovlp} for the EPIC-KITCHENS-100 Multi-Instance Retrieval (MIR) challenge.

Paper
Code