Search Results for author: Athanasios Katsamanis

The recent state of the art on monocular 3D face reconstruction from image data has made some impressive advancements, thanks to the advent of Deep Learning.

3D Face Reconstruction 3D Reconstruction

203

Paper
Code

Regotron: Regularizing the Tacotron2 architecture via monotonic alignment loss

no code implementations • 28 Apr 2022 • Efthymios Georgiou, Kosmas Kritsis, Georgios Paraskevopoulos, Athanasios Katsamanis, Vassilis Katsouros, Alexandros Potamianos

Recent deep learning Text-to-Speech (TTS) systems have achieved impressive performance by generating speech close to human parity.

Paper
Add Code

Zero-Shot Cross-lingual Aphasia Detection using Automatic Speech Recognition

no code implementations • 1 Apr 2022 • Gerasimos Chatzoudis, Manos Plitsis, Spyridoula Stamouli, Athanasia-Lida Dimou, Athanasios Katsamanis, Vassilis Katsouros

Like in many medical applications, aphasic speech data is scarce and the problem is exacerbated in so-called "low resource" languages, which are, for this task, most languages excluding English.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

EmpBot: A T5-based Empathetic Chatbot focusing on Sentiments

no code implementations • 30 Oct 2021 • Emmanouil Zaranis, Georgios Paraskevopoulos, Athanasios Katsamanis, Alexandros Potamianos

Specifically, during finetuning we propose to use three objectives: response language modeling, sentiment understanding, and empathy forcing.

Chatbot Language Modelling

Paper
Add Code

AudioVisual Speech Synthesis: A brief literature review

no code implementations • 18 Feb 2021 • Efthymios Georgiou, Athanasios Katsamanis

This brief literature review studies the problem of audiovisual speech synthesis, which is the problem of generating an animated talking head given a text as input.

Speech Synthesis

Paper
Add Code

The Twins Corpus of Museum Visitor Questions

no code implementations • LREC 2012 • Priti Aggarwal, Ron artstein, Jillian Gerten, Athanasios Katsamanis, Shrikanth Narayanan, Angela Nazarian, David Traum

In addition to speech recordings, the corpus contains the outputs of speech recognition performed at the time of utterance as well as the system interpretation of the utterances.

Dialogue Management Natural Language Understanding +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.