Multimodal Text and Image Classification
5 papers with code • 3 benchmarks • 4 datasets
Classification with both source Image and Text
Most implemented papers
Are These Birds Similar: Learning Branched Networks for Fine-grained Representations
In recent years, natural language descriptions are used to obtain information on discriminative parts of the object.
Analysis of Social Media Data using Multimodal Deep Learning for Disaster Response
Multimedia content in social media platforms provides significant information during disaster events.
Image and Text fusion for UPMC Food-101 \\using BERT and CNNs
The modern digital world is becoming more and more multimodal.
Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Memes Challenge
Memes on the Internet are often harmless and sometimes amusing.
Harmonic-NAS: Hardware-Aware Multimodal Neural Architecture Search on Resource-constrained Devices
In this paper, we propose Harmonic-NAS, a framework for the joint optimization of unimodal backbones and multimodal fusion networks with hardware awareness on resource-constrained devices.