Multimodal Text and Image Classification
5 papers with code • 3 benchmarks • 4 datasets
Classification with both source Image and Text
In recent years, natural language descriptions are used to obtain information on discriminative parts of the object.
Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Memes Challenge
Memes on the Internet are often harmless and sometimes amusing.
In this paper, we propose Harmonic-NAS, a framework for the joint optimization of unimodal backbones and multimodal fusion networks with hardware awareness on resource-constrained devices.