1 code implementation • 28 May 2024 • Panagiotis Koromilas, Giorgos Bouritsas, Theodoros Giannakopoulos, Mihalis Nicolaou, Yannis Panagakis
DHEL simplifies the problem by decoupling the target hyperspherical energy from the alignment of positive examples while preserving the same theoretical guarantees.
no code implementations • 3 Apr 2023 • Nikolaos Antoniou, Athanasios Katsamanis, Theodoros Giannakopoulos, Shrikanth Narayanan
There is an imminent need for guidelines and standard test sets to allow direct and fair comparisons of speech emotion recognition (SER).
1 code implementation • 21 Nov 2022 • Charilaos Papaioannou, Ioannis Valiantzas, Theodoros Giannakopoulos, Maximos Kaliakatsos-Papakostas, Alexandros Potamianos
The content has been collected from a Greek documentary series that is available online, where academics present music traditions of Greece with live music and dance performance during the show, along with discussions about social, cultural and musicological aspects of the presented music.
1 code implementation • LREC 2022 • Maria Moutti, Sofia Eleftheriou, Panagiotis Koromilas, Theodoros Giannakopoulos
Apart from a typical speech-to-text transcription with Automatic Speech Recognition (ASR), Speech Emotion Recognition (SER) can be used to automatically predict the underlying emotional content of speech dialogues in theatrical plays, and thus to provide a deeper understanding how the actors utter their lines.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 6 Oct 2021 • Panagiotis Koromilas, Theodoros Giannakopoulos
Multimodal Language Analysis is a demanding area of research, since it is associated with two requirements: combining different modalities and capturing temporal information.
Ranked #10 on Multimodal Sentiment Analysis on CMU-MOSEI (using extra training data)
1 code implementation • Pattern Recognition in Multimedia Signal Analysis 2021 • Theodoros Psallidas, Panagiotis Koromilas, Theodoros Giannakopoulos, Evaggelos Spyrou
In this work, we present an approach that uses both aural and visual features in order to create video summaries from user-generated videos.
no code implementations • 14 Apr 2021 • Georgios Paraskevopoulos, Efthymios Tzinis, Nikolaos Ellinas, Theodoros Giannakopoulos, Alexandros Potamianos
We examine the use of linear and non-linear dimensionality reduction algorithms for extracting low-rank feature representations for speech emotion recognition.
no code implementations • 28 Nov 2013 • Sergios Petridis, Theodoros Giannakopoulos, Costantine D. Spyropoulos
Finally, the pupil center and radius is estimated by optimal filtering within the area of the iris.