Learning a Text-Video Embedding from Incomplete and Heterogeneous Data

7 Apr 2018 Antoine Miech Ivan Laptev Josef Sivic

Joint understanding of video and language is an active research area with many applications. Prior work in this domain typically relies on learning text-video embeddings... (read more)

PDF Abstract

Results from the Paper


Ranked #4 on Video Retrieval on LSMDC (using extra training data)

     Get a GitHub badge
TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK USES EXTRA
TRAINING DATA
RESULT BENCHMARK
Video Retrieval LSMDC MoEE text-to-video R@1 10.1 # 4
text-to-video R@5 25.6 # 4
text-to-video R@10 34.6 # 4
text-to-video Median Rank 27 # 4

Methods used in the Paper


METHOD TYPE
🤖 No Methods Found Help the community by adding them if they're not listed; e.g. Deep Residual Learning for Image Recognition uses ResNet