1 code implementation • 25 May 2021 • Yang Li, Zinc Zhang, Hutchin Huang
In this paper, we utilize multi-modal, pre-trained models VilBERT and Visual BERT.
Data Augmentation