Multimodal Sentiment Analysis

72 papers with code • 5 benchmarks • 7 datasets

Multimodal sentiment analysis is the task of performing sentiment analysis with multiple data sources - e.g. a camera feed of someone's face and their recorded speech.

( Image credit: ICON: Interactive Conversational Memory Network for Multimodal Emotion Detection )

Benchmarks

Add a Result

These leaderboards are used to track progress in Multimodal Sentiment Analysis

Dataset	Best Model	Compare
CMU-MOSEI	SeMUL-PCD	See all
MOSI	SPECTRA	See all
CMU-MOSI	UniMSE	See all
B-T4SA	AutoML-Based Fusion Approach	See all
CH-SIMS	MMML	See all

Libraries

Use these libraries to find Multimodal Sentiment Analysis models and implementations

thuiar/MMSA

3 papers

566

Datasets

Most implemented papers

Most implemented Social Latest No code

Multimodal Speech Emotion Recognition Using Audio and Text

david-yoon/multimodal-speech-emotion • • 10 Oct 2018

Speech emotion recognition is a challenging task, and extensive reliance has been placed on models that use audio features in building well-performing classifiers.

Paper
Code

Words Can Shift: Dynamically Adjusting Word Representations Using Nonverbal Behaviors

victorywys/RAVEN • • 23 Nov 2018

Humans convey their intentions through the usage of both verbal and nonverbal behaviors during face-to-face communication.

Paper
Code

Multimodal Transformer for Unaligned Multimodal Language Sequences

yaohungt/Multimodal-Transformer • • ACL 2019

Human language is often multimodal, which comprehends a mixture of natural language, facial gestures, and acoustic behaviors.

Paper
Code

Efficient Low-rank Multimodal Fusion with Modality-Specific Factors

Justin1904/Low-rank-Multimodal-Fusion • • ACL 2018

Previous research in this field has exploited the expressiveness of tensors for multimodal representation.

Paper
Code

Complementary Fusion of Multi-Features and Multi-Modalities in Sentiment Analysis

robertjkeck2/EmoTe • 17 Apr 2019

Therefore, in this paper, based on audio and text, we consider the task of multimodal sentiment analysis and propose a novel fusion strategy including both multi-feature fusion and multi-modality fusion to improve the accuracy of audio-text sentiment analysis.

Paper
Code

M-SENA: An Integrated Platform for Multimodal Sentiment Analysis

thuiar/MMSA • • ACL 2022

The platform features a fully modular video sentiment analysis framework consisting of data management, feature extraction, model training, and result analysis modules.

Paper
Code