Speech Emotion Recognition

54 papers with code • 6 benchmarks • 7 datasets

This task has no description! Would you like to contribute one?

Libraries

Use these libraries to find Speech Emotion Recognition models and implementations

Most implemented papers

Continuous control with deep reinforcement learning

hill-a/stable-baselines 9 Sep 2015

We adapt the ideas underlying the success of Deep Q-Learning to the continuous action domain.

Multimodal Speech Emotion Recognition and Ambiguity Resolution

Demfier/multimodal-speech-emotion-recognition 12 Apr 2019

In this work, we adopt a feature-engineering based approach to tackle the task of speech emotion recognition.

Multimodal Speech Emotion Recognition Using Audio and Text

david-yoon/multimodal-speech-emotion 10 Oct 2018

Speech emotion recognition is a challenging task, and extensive reliance has been placed on models that use audio features in building well-performing classifiers.

Speech Emotion Recognition Using Multi-hop Attention Mechanism

warnikchow/coaudiotext 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2019

As opposed to using knowledge from both the modalities separately, we propose a framework to exploit acoustic information in tandem with lexical data.

Deep Learning based Emotion Recognition System Using Speech Features and Transcriptions

MagnusXu/Speech-Emotion-Recognition-Capstone-Project 11 Jun 2019

This paper proposes a speech emotion recognition method based on speech features and speech transcriptions (text).

Seen and Unseen emotional style transfer for voice conversion with a new emotional speech dataset

HLTSingapore/Emotional-Speech-Data 28 Oct 2020

Emotional voice conversion aims to transform emotional prosody in speech while preserving the linguistic content and speaker identity.

AST: Audio Spectrogram Transformer

YuanGongND/ast 5 Apr 2021

In the past decade, convolutional neural networks (CNNs) have been widely adopted as the main building block for end-to-end audio classification models, which aim to learn a direct mapping from audio spectrograms to corresponding labels.

Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings

habla-liaa/ser-with-w2v2 8 Apr 2021

Emotion recognition datasets are relatively small, making the use of the more sophisticated deep learning approaches challenging.

SERAB: A multi-lingual benchmark for speech emotion recognition

neclow/serab 7 Oct 2021

To facilitate the process, here, we present the Speech Emotion Recognition Adaptation Benchmark (SERAB), a framework for evaluating the performance and generalization capacity of different approaches for utterance-level SER.

Transfer Learning for Improving Speech Emotion Classification Accuracy

raulsteleac/Speech_Emotion_Recognition 19 Jan 2018

The majority of existing speech emotion recognition research focuses on automatic emotion detection using training and testing data from same corpus collected under the same conditions.