no code implementations • 26 May 2025 • Alkis Koudounas, Moreno La Quatra, Elena Baralis
Recent advances in conversational AI have demonstrated impressive capabilities in single-turn responses, yet multi-turn dialogues remain challenging for even the most sophisticated language models.
1 code implementation • 26 May 2025 • Moreno La Quatra, Alkis Koudounas, Valerio Mario Salerno, Sabato Marco Siniscalchi
Despite the remarkable progress in end-to-end Automatic Speech Recognition (ASR) engines, accurately transcribing dysarthric speech remains a major challenge.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+1
1 code implementation • 26 May 2025 • Alkis Koudounas, Moreno La Quatra, Eliana Pastor, Sabato Marco Siniscalchi, Elena Baralis
Kolmogorov-Arnold Networks (KANs) have recently emerged as a promising alternative to traditional neural architectures, yet their application to speech processing remains under explored.
1 code implementation • 26 May 2025 • Alkis Koudounas, Moreno La Quatra, Gabriele Ciravegna, Marco Fantini, Erika Crosetti, Giovanni Succo, Tania Cerquitelli, Sabato Marco Siniscalchi, Elena Baralis
Voice disorders significantly impact patient quality of life, yet non-invasive automated diagnosis remains under-explored due to both the scarcity of pathological voice data, and the variability in recording sources.
1 code implementation • 21 May 2025 • Alkis Koudounas, Claudio Savelli, Flavio Giobergia, Elena Baralis
Machine unlearning, the process of efficiently removing specific information from machine learning models, is a growing area of interest for responsible AI.
1 code implementation • 22 Feb 2025 • Alkis Koudounas, Moreno La Quatra, Marco Sabato Siniscalchi, Elena Baralis
In this work, we aim to overcome the above shortcoming and propose a novel foundation model, termed voc2vec, specifically designed for non-verbal human data leveraging exclusively open-source non-verbal audio datasets.
1 code implementation • 22 Jun 2024 • Moreno La Quatra, Alkis Koudounas, Elena Baralis, Sabato Marco Siniscalchi
We leverage self-supervised learning models to tackle this task and analyze differences and similarities between Italy's regional languages.
1 code implementation • 20 Jun 2024 • Alkis Koudounas, Gabriele Ciravegna, Marco Fantini, Giovanni Succo, Erika Crosetti, Tania Cerquitelli, Elena Baralis
Voice disorders are pathologies significantly affecting patient quality of life.
1 code implementation • 20 Jun 2024 • Alkis Koudounas, Flavio Giobergia, Eliana Pastor, Elena Baralis
Speech models may be affected by performance imbalance in different population subgroups, raising concerns about fair treatment across these groups.
1 code implementation • 2 May 2024 • Moreno La Quatra, Alkis Koudounas, Lorenzo Vaiani, Elena Baralis, Luca Cagliero, Paolo Garza, Sabato Marco Siniscalchi
Limited diversity in standardized benchmarks for evaluating audio representation learning (ARL) methods may hinder systematic comparison of current methods' capabilities.
no code implementations • 31 Mar 2024 • Alkis Koudounas, Flavio Giobergia
We identify subgroups of audio recordings based on combinations of these metadata and compute each subgroup's performance (e. g., Word Error Rate) and the difference in performance (''divergence'') w. r. t the overall population.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+2
no code implementations • 1 Mar 2024 • Federico Borra, Claudio Savelli, Giacomo Rosso, Alkis Koudounas, Flavio Giobergia
In Natural Language Generation (NLG), contemporary Large Language Models (LLMs) face several challenges, such as generating fluent yet inaccurate outputs and reliance on fluency-centric metrics.
no code implementations • 2 Oct 2023 • Flavio Giobergia, Alkis Koudounas, Elena Baralis
Exploring exoplanets has transformed our understanding of the universe by revealing many planetary systems that defy our current understanding.
1 code implementation • 14 Sep 2023 • Eliana Pastor, Alkis Koudounas, Giuseppe Attanasio, Dirk Hovy, Elena Baralis
Existing work focuses on a few spoken language understanding (SLU) tasks, and explanations are difficult to interpret for most users.
1 code implementation • 14 Jun 2023 • Alkis Koudounas, Moreno La Quatra, Lorenzo Vaiani, Luca Colomba, Giuseppe Attanasio, Eliana Pastor, Luca Cagliero, Elena Baralis
Recent large-scale Spoken Language Understanding datasets focus predominantly on English and do not account for language-specific phenomena such as particular phonemes or words in different lects.