Search Results for author: Athanasios Katsamanis

Found 10 papers, 2 papers with code

Efficient Audio Captioning Transformer with Patchout and Text Guidance

no code implementations6 Apr 2023 Thodoris Kouzelis, Grigoris Bastas, Athanasios Katsamanis, Alexandros Potamianos

The results show that the proposed techniques improve the performance of our system and while reducing the computational complexity.

Audio captioning Caption Generation +3

Visual Speech-Aware Perceptual 3D Facial Expression Reconstruction from Videos

1 code implementation22 Jul 2022 Panagiotis P. Filntisis, George Retsinas, Foivos Paraperas-Papantoniou, Athanasios Katsamanis, Anastasios Roussos, Petros Maragos

The recent state of the art on monocular 3D face reconstruction from image data has made some impressive advancements, thanks to the advent of Deep Learning.

3D Face Reconstruction 3D Reconstruction

Regotron: Regularizing the Tacotron2 architecture via monotonic alignment loss

no code implementations28 Apr 2022 Efthymios Georgiou, Kosmas Kritsis, Georgios Paraskevopoulos, Athanasios Katsamanis, Vassilis Katsouros, Alexandros Potamianos

Recent deep learning Text-to-Speech (TTS) systems have achieved impressive performance by generating speech close to human parity.

Zero-Shot Cross-lingual Aphasia Detection using Automatic Speech Recognition

no code implementations1 Apr 2022 Gerasimos Chatzoudis, Manos Plitsis, Spyridoula Stamouli, Athanasia-Lida Dimou, Athanasios Katsamanis, Vassilis Katsouros

Like in many medical applications, aphasic speech data is scarce and the problem is exacerbated in so-called "low resource" languages, which are, for this task, most languages excluding English.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

EmpBot: A T5-based Empathetic Chatbot focusing on Sentiments

no code implementations30 Oct 2021 Emmanouil Zaranis, Georgios Paraskevopoulos, Athanasios Katsamanis, Alexandros Potamianos

Specifically, during finetuning we propose to use three objectives: response language modeling, sentiment understanding, and empathy forcing.

Chatbot Language Modelling

AudioVisual Speech Synthesis: A brief literature review

no code implementations18 Feb 2021 Efthymios Georgiou, Athanasios Katsamanis

This brief literature review studies the problem of audiovisual speech synthesis, which is the problem of generating an animated talking head given a text as input.

Speech Synthesis

The Twins Corpus of Museum Visitor Questions

no code implementations LREC 2012 Priti Aggarwal, Ron artstein, Jillian Gerten, Athanasios Katsamanis, Shrikanth Narayanan, Angela Nazarian, David Traum

In addition to speech recordings, the corpus contains the outputs of speech recognition performed at the time of utterance as well as the system interpretation of the utterances.

Dialogue Management Natural Language Understanding +3

Cannot find the paper you are looking for? You can Submit a new open access paper.