Search Results for author: Björn Schuller

Found 56 papers, 21 papers with code

End-to-End Multimodal Emotion Recognition using Deep Neural Networks

2 code implementations • 27 Apr 2017 • Panagiotis Tzirakis, George Trigeorgis, Mihalis A. Nicolaou, Björn Schuller, Stefanos Zafeiriou

The system is then trained in an end-to-end fashion where - by also taking advantage of the correlations of the each of the streams - we manage to significantly outperform the traditional approaches based on auditory and visual handcrafted features for the prediction of spontaneous and natural emotions on the RECOLA database of the AVEC 2016 research challenge on emotion recognition.

Multimodal Emotion Recognition Retrieval

213

Paper
Code

Applying Cooperative Machine Learning to Speed Up the Annotation of Social Signals in Large Multi-modal Corpora

1 code implementation • 7 Feb 2018 • Johannes Wagner, Tobias Baur, Yue Zhang, Michel F. Valstar, Björn Schuller, Elisabeth André

Scientific disciplines, such as Behavioural Psychology, Anthropology and recently Social Signal Processing are concerned with the systematic exploration of human behaviour.

170

Paper
Code

auDeep: Unsupervised Learning of Representations from Audio with Deep Recurrent Neural Networks

1 code implementation • 12 Dec 2017 • Michael Freitag, Shahin Amiriparian, Sergey Pugachevskiy, NIcholas Cummins, Björn Schuller

auDeep is a Python toolkit for deep unsupervised representation learning from acoustic data.

Sound Audio and Speech Processing

147

Paper
Code

Deep Affect Prediction in-the-wild: Aff-Wild Database and Challenge, Deep Architectures, and Beyond

1 code implementation • 29 Apr 2018 • Dimitrios Kollias, Panagiotis Tzirakis, Mihalis A. Nicolaou, Athanasios Papaioannou, Guoying Zhao, Björn Schuller, Irene Kotsia, Stefanos Zafeiriou

Automatic understanding of human affect using visual signals is of great importance in everyday human-machine interactions.

Emotion Recognition

Paper
Code

Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning

1 code implementation • 15 Jun 2022 • Rui Liu, Berrak Sisman, Björn Schuller, Guanglai Gao, Haizhou Li

In this paper, we propose a data-driven deep learning model, i. e. StrengthNet, to improve the generalization of emotion strength assessment for seen and unseen speech.

Attribute Emotion Classification +2

Paper
Code

N-HANS: Introducing the Augsburg Neuro-Holistic Audio-eNhancement System

1 code implementation • 16 Nov 2019 • Shuo Liu, Gil Keren, Björn Schuller

N-HANS is a Python toolkit for in-the-wild audio enhancement, including speech, music, and general audio denoising, separation, and selective noise or source suppression.

Sound Audio and Speech Processing

Paper
Code

EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation

2 code implementations • 6 Sep 2023 • Nikolai Körber, Eduard Kromer, Andreas Siebert, Sascha Hauke, Daniel Mueller-Gritschneder, Björn Schuller

We introduce EGIC, an enhanced generative image compression method that allows traversing the distortion-perception curve efficiently from a single model.

Image Compression Semantic Segmentation

Paper
Code

Convolutional RNN: an Enhanced Model for Extracting Features from Sequential Data

3 code implementations • 18 Feb 2016 • Gil Keren, Björn Schuller

Traditional convolutional layers extract features from patches of data by applying a non-linearity on an affine function of the input.

Audio Classification

Paper
Code

EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion Recognition

1 code implementation • 10 Mar 2021 • Maurice Gerczuk, Shahin Amiriparian, Sandra Ottl, Björn Schuller

The corpus is then utilised to create a novel framework for multi-corpus speech emotion recognition, namely EmoNet.

Speech Emotion Recognition Transfer Learning

Paper
Code

The ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts

2 code implementations • 3 May 2022 • Alice Baird, Panagiotis Tzirakis, Gauthier Gidel, Marco Jiralerspong, Eilif B. Muller, Kory Mathewson, Björn Schuller, Erik Cambria, Dacher Keltner, Alan Cowen

ExVo 2022, includes three competition tracks using a large-scale dataset of 59, 201 vocalizations from 1, 702 speakers.

Few-Shot Learning

Paper
Code

The ACII 2022 Affective Vocal Bursts Workshop & Competition: Understanding a critically understudied modality of emotional expression

3 code implementations • 7 Jul 2022 • Alice Baird, Panagiotis Tzirakis, Jeffrey A. Brooks, Christopher B. Gregory, Björn Schuller, Anton Batliner, Dacher Keltner, Alan Cowen

The ACII Affective Vocal Bursts Workshop & Competition is focused on understanding multiple affective dimensions of vocal bursts: laughs, gasps, cries, screams, and many other non-linguistic vocalizations central to the expression of emotion and to human communication more generally.

A-VB Culture A-VB High +2

Paper
Code

Nkululeko: A Tool For Rapid Speaker Characteristics Detection

1 code implementation • LREC 2022 • Felix Burkhardt, Johannes Wagner, Hagen Wierstorf, Florian Eyben, Björn Schuller

We present advancements with a software tool called Nkululeko, that lets users perform (semi-) supervised machine learning experiments in the speaker characteristics domain.

Emotion Classification regression

Paper
Code

Poisson CNN: Convolutional neural networks for the solution of the Poisson equation on a Cartesian mesh

1 code implementation • 18 Oct 2019 • Ali Girayhan Özbay, Arash Hamzehloo, Sylvain Laizet, Panagiotis Tzirakis, Georgios Rizos, Björn Schuller

The Poisson equation is commonly encountered in engineering, for instance in computational fluid dynamics (CFD) where it is needed to compute corrections to the pressure field to ensure the incompressibility of the velocity field.

Paper
Code

Calibrated Prediction Intervals for Neural Network Regressors

1 code implementation • 26 Mar 2018 • Gil Keren, NIcholas Cummins, Björn Schuller

Despite their obvious aforementioned advantage in relation to accuracy, contemporary neural networks can, generally, be regarded as poorly calibrated and as such do not produce reliable output probability estimates.

Prediction Intervals

Paper
Code

A Machine Learning Framework for Automatic Prediction of Human Semen Motility

1 code implementation • 16 Sep 2021 • Sandra Ottl, Shahin Amiriparian, Maurice Gerczuk, Björn Schuller

Finally, a linear SVR is trained on this feature representation.

BIG-bench Machine Learning

Paper
Code

Personalized Federated Deep Learning for Pain Estimation From Face Images

1 code implementation • 12 Jan 2021 • Ognjen Rudovic, Nicolas Tobis, Sebastian Kaltwang, Björn Schuller, Daniel Rueckert, Jeffrey F. Cohn, Rosalind W. Picard

A potential approach to tackling this is Federated Learning (FL), which enables multiple parties to collaboratively learn a shared prediction model by using parameters of locally trained models while keeping raw training data locally.

Federated Learning

Paper
Code

Fast Single-Class Classification and the Principle of Logit Separation

2 code implementations • 29 May 2017 • Gil Keren, Sivan Sabato, Björn Schuller

Our experiments show that indeed in almost all cases, losses that are aligned with the Principle of Logit Separation obtain at least 20% relative accuracy improvement in the SLC task compared to losses that are not aligned with it, and sometimes considerably more.

Binary Classification Classification +2

Paper
Code

The Many-to-Many Mapping Between the Concordance Correlation Coefficient and the Mean Square Error

1 code implementation • 14 Feb 2019 • Vedhas Pandit, Björn Schuller

Despite its drawbacks, $MSE$ is one of the most popular performance metrics (and a loss function); along with lately $\rho_c$ in many of the sequence prediction challenges.

Sentiment Analysis Time Series Analysis

Paper
Code

Data Augmentation for Dementia Detection in Spoken Language

1 code implementation • 26 Jun 2022 • Anna Hlédiková, Dominika Woszczyk, Alican Akman, Soteris Demetriou, Björn Schuller

In this work, we investigate data augmentation techniques for the task of AD detection and perform an empirical evaluation of the different approaches on two kinds of models for both the text and audio domains.

Data Augmentation

Paper
Code

Noise Invariant Frame Selection: A Simple Method to Address the Background Noise Problem for Text-independent Speaker Verification

1 code implementation • 3 May 2018 • Siyang Song, Shuimei Zhang, Björn Schuller, Linlin Shen, Michel Valstar

The performance of speaker-related systems usually degrades heavily in practical applications largely due to the presence of background noise.

Quantization Text-Independent Speaker Verification

Paper
Code

An Enhanced Adversarial Network with Combined Latent Features for Spatio-Temporal Facial Affect Estimation in the Wild

1 code implementation • 18 Feb 2021 • Decky Aspandi, Federico Sukno, Björn Schuller, Xavier Binefa

This paper addresses these shortcomings by proposing a novel model that efficiently extracts both spatial and temporal features of the data by means of its enhanced temporal modelling based on latent features.

Paper
Code

audEERING's approach to the One-Minute-Gradual Emotion Challenge

no code implementations • 3 May 2018 • Andreas Triantafyllopoulos, Hesam Sagha, Florian Eyben, Björn Schuller

This paper describes audEERING's submissions as well as additional evaluations for the One-Minute-Gradual (OMG) emotion recognition challenge.

Emotion Recognition

Paper
Add Code

Weakly Supervised One-Shot Detection with Attention Similarity Networks

no code implementations • 10 Jan 2018 • Gil Keren, Maximilian Schmitt, Thomas Kehrenberg, Björn Schuller

Neural network models that are not conditioned on class identities were shown to facilitate knowledge transfer between classes and to be well-suited for one-shot learning tasks.

One-Shot Learning Transfer Learning

Paper
Add Code

Learning audio sequence representations for acoustic event classification

no code implementations • 27 Jul 2017 • Zixing Zhang, Ding Liu, Jing Han, Kun Qian, Björn Schuller

Extensive evaluation on a large-size acoustic event database is performed, and the empirical results demonstrate that the learnt audio sequence representation yields a significant performance improvement by a large margin compared with other state-of-the-art hand-crafted sequence features for AEC.

Classification General Classification

Paper
Add Code

Deep Learning for Environmentally Robust Speech Recognition: An Overview of Recent Developments

no code implementations • 30 May 2017 • Zixing Zhang, Jürgen Geiger, Jouni Pohjalainen, Amr El-Desoky Mousa, Wenyu Jin, Björn Schuller

Eliminating the negative effect of non-stationary environmental noise is a long-standing research topic for automatic speech recognition that stills remains an important challenge.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Deep Structured Learning for Facial Action Unit Intensity Estimation

no code implementations • CVPR 2017 • Robert Walecki, Ognjen, Rudovic, Vladimir Pavlovic, Björn Schuller, Maja Pantic

The goal of this paper is to model these structures and estimate complex feature representations simultaneously by combining conditional random field (CRF) encoded AU dependencies with deep learning.

Paper
Add Code

Tunable Sensitivity to Large Errors in Neural Network Training

no code implementations • 23 Nov 2016 • Gil Keren, Sivan Sabato, Björn Schuller

We propose incorporating this idea of tunable sensitivity for hard examples in neural network learning, using a new generalization of the cross-entropy gradient step, which can be used in place of the gradient in any gradient-based training method.

Paper
Add Code

Detecting Road Surface Wetness from Audio: A Deep Learning Approach

no code implementations • 22 Nov 2015 • Irman Abdić, Lex Fridman, Erik Marchi, Daniel E. Brown, William Angell, Bryan Reimer, Björn Schuller

We introduce a recurrent neural network architecture for automated road surface wetness detection from audio of tire-surface interaction.

General Classification

Paper
Add Code

A Broadcast News Corpus for Evaluation and Tuning of German LVCSR Systems

no code implementations • 15 Dec 2014 • Felix Weninger, Björn Schuller, Florian Eyben, Martin Wöllmer, Gerhard Rigoll

Transcription of broadcast news is an interesting and challenging application for large-vocabulary continuous speech recognition (LVCSR).

speech-recognition Speech Recognition

Paper
Add Code

Acoustic Gait-based Person Identification using Hidden Markov Models

no code implementations • 11 Jun 2014 • Jürgen T. Geiger, Maximilian Kneißl, Björn Schuller, Gerhard Rigoll

The goal of the system is to analyse sounds emitted by walking persons (mostly the step sounds) and identify those persons.

Gait Recognition Person Identification

Paper
Add Code

The state of play of ASC-Inclusion: An Integrated Internet-Based Environment for Social Inclusion of Children with Autism Spectrum Conditions

no code implementations • 24 Mar 2014 • Björn Schuller, Erik Marchi, Simon Baron-Cohen, Helen O'Reilly, Delia Pigat, Peter Robinson, Ian Daves

Individuals with Autism Spectrum Conditions (ASC) have marked difficulties using verbal and non-verbal communication for social interaction.

Paper
Add Code

6th International Symposium on Attention in Cognitive Systems 2013

no code implementations • 22 Jul 2013 • Lucas Paletta, Laurent Itti, Björn Schuller, Fang Fang

This volume contains the papers accepted at the 6th International Symposium on Attention in Cognitive Systems (ISACS 2013), held in Beijing, August 5, 2013.

Paper
Add Code

Adversarial Training in Affective Computing and Sentiment Analysis: Recent Advances and Perspectives

no code implementations • 21 Sep 2018 • Jing Han, Zixing Zhang, NIcholas Cummins, Björn Schuller

Over the past few years, adversarial training has become an extremely active research topic and has been successfully applied to various Artificial Intelligence (AI) domains.

Sentiment Analysis

Paper
Add Code

Scaling Speech Enhancement in Unseen Environments with Noise Embeddings

no code implementations • 26 Oct 2018 • Gil Keren, Jing Han, Björn Schuller

We address the problem of speech enhancement generalisation to unseen environments by performing two manipulations.

Speech Enhancement speech-recognition +1

Paper
Add Code

The Principle of Logit Separation

no code implementations • ICLR 2018 • Gil Keren, Sivan Sabato, Björn Schuller

In contrast, there are known loss functions, as well as novel batch loss functions that we propose, which are aligned with this principle.

Image Retrieval

Paper
Add Code

Voice command generation using Progressive Wavegans

no code implementations • 13 Mar 2019 • Thomas Wiest, NIcholas Cummins, Alice Baird, Simone Hantke, Judith Dineley, Björn Schuller

Generative Adversarial Networks (GANs) have become exceedingly popular in a wide range of data-driven research fields, due in part to their success in image generation.

Audio Generation Image Generation

Paper
Add Code

Synthesising 3D Facial Motion from "In-the-Wild" Speech

no code implementations • 15 Apr 2019 • Panagiotis Tzirakis, Athanasios Papaioannou, Alexander Lattas, Michail Tarasiou, Björn Schuller, Stefanos Zafeiriou

Synthesising 3D facial motion from speech is a crucial problem manifesting in a multitude of applications such as computer games and movies.

Lip Reading Motion Synthesis

Paper
Add Code

Single-Channel Speech Separation with Auxiliary Speaker Embeddings

no code implementations • 24 Jun 2019 • Shuo Liu, Gil Keren, Björn Schuller

We present a novel source separation model to decompose asingle-channel speech signal into two speech segments belonging to two different speakers.

Speech Separation

Paper
Add Code

EmoBed: Strengthening Monomodal Emotion Recognition via Training with Crossmodal Emotion Embeddings

no code implementations • 23 Jul 2019 • Jing Han, Zixing Zhang, Zhao Ren, Björn Schuller

Motivated by this, we propose a novel crossmodal emotion embedding framework called EmoBed, which aims to leverage the knowledge from other auxiliary modalities to improve the performance of an emotion recognition system at hand.

Emotion Classification Emotion Recognition

Paper
Add Code

AVEC 2019 Workshop and Challenge: State-of-Mind, Detecting Depression with AI, and Cross-Cultural Affect Recognition

no code implementations • 10 Jul 2019 • Fabien Ringeval, Björn Schuller, Michel Valstar, NIcholas Cummins, Roddy Cowie, Leili Tavabi, Maximilian Schmitt, Sina Alisamir, Shahin Amiriparian, Eva-Maria Messner, Siyang Song, Shuo Liu, Ziping Zhao, Adria Mallol-Ragolta, Zhao Ren, Mohammad Soleymani, Maja Pantic

The Audio/Visual Emotion Challenge and Workshop (AVEC 2019) "State-of-Mind, Detecting Depression with AI, and Cross-cultural Affect Recognition" is the ninth competition event aimed at the comparison of multimedia processing and machine learning methods for automatic audiovisual health and emotion analysis, with all participants competing strictly under the same conditions.

Emotion Recognition

Paper
Add Code

On Laughter and Speech-Laugh, Based on Observations of Child-Robot Interaction

no code implementations • 30 Aug 2019 • Anton Batliner, Stefan Steidl, Florian Eyben, Björn Schuller

In this article, we study laughter found in child-robot interaction where it had not been prompted intentionally.

Descriptive feature selection +2

Paper
Add Code

Adversarial-based neural networks for affect estimations in the wild

no code implementations • 3 Feb 2020 • Decky Aspandi, Adria Mallol-Ragolta, Björn Schuller, Xavier Binefa

However, the use of latent features, which is feasible through adversarial learning, is not largely explored, yet.

Paper
Add Code

Guided Generative Adversarial Neural Network for Representation Learning and High Fidelity Audio Generation using Fewer Labelled Audio Data

no code implementations • 5 Mar 2020 • Kazi Nazmul Haque, Rajib Rana, John H. L. Hansen, Björn Schuller

However, the model can become redundant if it is intended for a specific task.

Audio Generation Emotion Recognition +2

Paper
Add Code

Cross-lingual Zero- and Few-shot Hate Speech Detection Utilising Frozen Transformer Language Models and AXEL

no code implementations • 13 Apr 2020 • Lukas Stappen, Fabian Brunn, Björn Schuller

Detecting hate speech, especially in low-resource languages, is a non-trivial challenge.

Few-Shot Learning General Classification +1

Paper
Add Code

An Overview on Audio, Signal, Speech, & Language Processing for COVID-19

no code implementations • 18 May 2020 • Gauri Deshpande, Björn Schuller

Recently, there has been an increased attention towards innovating, enhancing, building, and deploying applications of speech signal processing for providing assistance and relief to human mankind from the Coronavirus (COVID-19) pandemic.

Computers and Society Sound Audio and Speech Processing

Paper
Add Code

The Multimodal Sentiment Analysis in Car Reviews (MuSe-CaR) Dataset: Collection, Insights and Improvements

no code implementations • 15 Jan 2021 • Lukas Stappen, Alice Baird, Lea Schumann, Björn Schuller

Truly real-life data presents a strong, but exciting challenge for sentiment and emotion research.

Multimodal Sentiment Analysis

Paper
Add Code

Fitbeat: COVID-19 Estimation based on Wristband Heart Rate

no code implementations • 19 Apr 2021 • Shuo Liu, Jing Han, Estela Laporta Puyal, Spyridon Kontaxis, Shaoxiong Sun, Patrick Locatelli, Judith Dineley, Florian B. Pokorny, Gloria Dalla Costa, Letizia Leocan, Ana Isabel Guerrero, Carlos Nos, Ana Zabalza, Per Soelberg Sørensen, Mathias Buron, Melinda Magyari, Yatharth Ranjan, Zulqarnain Rashid, Pauline Conde, Callum Stewart, Amos A Folarin, Richard JB Dobson, Raquel Bailón, Srinivasan Vairavan, NIcholas Cummins, Vaibhav A Narayan, Matthew Hotopf, Giancarlo Comi, Björn Schuller

This study investigates the potential of deep learning methods to identify individuals with suspected COVID-19 infection using remotely collected heart-rate data.

Specificity

Paper
Add Code

An Estimation of Online Video User Engagement from Features of Continuous Emotions

no code implementations • 4 May 2021 • Lukas Stappen, Alice Baird, Michelle Lienhart, Annalena Bätz, Björn Schuller

We investigate features extracted from these signals against various user engagement indicators including views, like/dislike ratio, as well as the sentiment of comments.

feature selection Time Series Analysis

Paper
Add Code

Uncertainty Aware Review Hallucination for Science Article Classification

no code implementations • Findings (ACL) 2021 • Korbinian Friedl, Georgios Rizos, Lukas Stappen, Madina Hasan, Lucia Specia, Thomas Hain, Björn Schuller

Classification Hallucination

Paper
Add Code

Proceedings of the ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts

no code implementations • 14 Jul 2022 • Alice Baird, Panagiotis Tzirakis, Gauthier Gidel, Marco Jiralerspong, Eilif B. Muller, Kory Mathewson, Björn Schuller, Erik Cambria, Dacher Keltner, Alan Cowen

The first, ExVo-MultiTask, requires participants to train a multi-task model to recognize expressed emotions and demographic traits from vocal bursts.

Few-Shot Learning

Paper
Add Code

A Comparative Cross Language View On Acted Databases Portraying Basic Emotions Utilising Machine Learning

no code implementations • LREC 2022 • Felix Burkhardt, Anabell Hacker, Uwe Reichel, Hagen Wierstorf, Florian Eyben, Björn Schuller

Since several decades emotional databases have been recorded by various laboratories.

Paper
Add Code

SyntAct: A Synthesized Database of Basic Emotions

no code implementations • DCLRL (LREC) 2022 • Felix Burkhardt, Florian Eyben, Björn Schuller

Speech emotion recognition is in the focus of research since several decades and has many applications.

Speech Emotion Recognition Speech Synthesis

Paper
Add Code

Proceedings of the ACII Affective Vocal Bursts Workshop and Competition 2022 (A-VB): Understanding a critically understudied modality of emotional expression

no code implementations • 27 Oct 2022 • Alice Baird, Panagiotis Tzirakis, Jeffrey A. Brooks, Christopher B. Gregory, Björn Schuller, Anton Batliner, Dacher Keltner, Alan Cowen

This is the Proceedings of the ACII Affective Vocal Bursts Workshop and Competition (A-VB).

Paper
Add Code

Enhancing Speech Emotion Recognition Through Differentiable Architecture Search

no code implementations • 23 May 2023 • Thejan Rajapakshe, Rajib Rana, Sara Khalifa, Berrak Sisman, Björn Schuller

In contrast to previous studies, we refrain from imposing constraints on the order of the layers for the CNN within the DARTS cell; instead, we allow DARTS to determine the optimal layer order autonomously.

Ranked #5 on Speech Emotion Recognition on IEMOCAP (UA metric)

Neural Architecture Search Speech Emotion Recognition

Paper
Add Code

Improving Speaker-independent Speech Emotion Recognition Using Dynamic Joint Distribution Adaptation

no code implementations • 18 Jan 2024 • Cheng Lu, Yuan Zong, Hailun Lian, Yan Zhao, Björn Schuller, Wenming Zheng

In speaker-independent speech emotion recognition, the training and testing samples are collected from diverse speakers, leading to a multi-domain shift challenge across the feature distributions of data from different speakers.

Domain Adaptation Speech Emotion Recognition

Paper
Add Code

Speech Swin-Transformer: Exploring a Hierarchical Transformer with Shifted Windows for Speech Emotion Recognition

no code implementations • 19 Jan 2024 • Yong Wang, Cheng Lu, Hailun Lian, Yan Zhao, Björn Schuller, Yuan Zong, Wenming Zheng

These segment-level patches are then encoded using a stack of Swin blocks, in which a local window Transformer is utilized to explore local inter-frame emotional information across frame patches of each segment patch.

Speech Emotion Recognition

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.