Bilinear Attention Networks

NeurIPS 2018 Jin-Hwa KimJaehyun JunByoung-Tak Zhang

Attention networks in multimodal learning provide an efficient way to utilize given visual information selectively. However, the computational cost to learn attention distributions for every pair of multimodal input channels is prohibitively expensive... (read more)

PDF Abstract

Evaluation Results from the Paper


TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK COMPARE
Visual Question Answering VQA v2 BAN+Glove+Counter Accuracy 70.04% # 4