Noise Estimation Using Density Estimation for Self-Supervised Multimodal Learning

6 Mar 2020Elad AmraniRami Ben-AriDaniel RotmanAlex Bronstein

One of the key factors of enabling machine learning models to comprehend and solve real-world tasks is to leverage multimodal data. Unfortunately, annotation of multimodal data is challenging and expensive... (read more)

PDF Abstract

Results from the Paper


TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK RESULT LEADERBOARD
Visual Question Answering MSRVTT-QA SSML Accuracy 0.35 # 2
Video Retrieval MSVD SSML text-to-video [email protected] 20.3 # 1
text-to-video [email protected] 49.0 # 1
text-to-video [email protected] 63.3 # 2
text-to-video Median Rank 6.0 # 1
text-to-video [email protected] -- # 2
text-to-video Mean Rank -- # 2
Visual Question Answering MSVD-QA SSML Accuracy 0.351 # 2