1 code implementation • FNP (COLING) 2020 • Moreno La Quatra, Luca Cagliero
Quoted companies are requested to periodically publish financial reports in textual form.
1 code implementation • 26 May 2025 • Alkis Koudounas, Moreno La Quatra, Gabriele Ciravegna, Marco Fantini, Erika Crosetti, Giovanni Succo, Tania Cerquitelli, Sabato Marco Siniscalchi, Elena Baralis
Voice disorders significantly impact patient quality of life, yet non-invasive automated diagnosis remains under-explored due to both the scarcity of pathological voice data, and the variability in recording sources.
no code implementations • 26 May 2025 • Alkis Koudounas, Moreno La Quatra, Elena Baralis
Recent advances in conversational AI have demonstrated impressive capabilities in single-turn responses, yet multi-turn dialogues remain challenging for even the most sophisticated language models.
1 code implementation • 26 May 2025 • Alkis Koudounas, Moreno La Quatra, Eliana Pastor, Sabato Marco Siniscalchi, Elena Baralis
Kolmogorov-Arnold Networks (KANs) have recently emerged as a promising alternative to traditional neural architectures, yet their application to speech processing remains under explored.
1 code implementation • 26 May 2025 • Moreno La Quatra, Alkis Koudounas, Valerio Mario Salerno, Sabato Marco Siniscalchi
Despite the remarkable progress in end-to-end Automatic Speech Recognition (ASR) engines, accurately transcribing dysarthric speech remains a major challenge.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+1
1 code implementation • 13 Mar 2025 • Moreno La Quatra, Juan Rafael Orozco-Arroyave, Marco Sabato Siniscalchi
One head is specialized for diadochokinetic patterns.
1 code implementation • 22 Feb 2025 • Alkis Koudounas, Moreno La Quatra, Marco Sabato Siniscalchi, Elena Baralis
In this work, we aim to overcome the above shortcoming and propose a novel foundation model, termed voc2vec, specifically designed for non-verbal human data leveraging exclusively open-source non-verbal audio datasets.
1 code implementation • 22 Jan 2025 • Moreno La Quatra, Valerio Mario Salerno, Yu Tsao, Sabato Marco Siniscalchi
In this paper, we present an encoder-decoder model leveraging Flan-T5 for post-Automatic Speech Recognition (ASR) Generative Speech Error Correction (GenSEC), and we refer to it as FlanEC.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+3
1 code implementation • 23 Jun 2024 • Moreno La Quatra, Maria Francesca Turco, Torbjørn Svendsen, Giampiero Salvi, Juan Rafael Orozco-Arroyave, Sabato Marco Siniscalchi
This work is concerned with devising a robust Parkinson's (PD) disease detector from speech in real-world operating conditions using (i) foundational models, and (ii) speech enhancement (SE) methods.
1 code implementation • 22 Jun 2024 • Moreno La Quatra, Alkis Koudounas, Elena Baralis, Sabato Marco Siniscalchi
We leverage self-supervised learning models to tackle this task and analyze differences and similarities between Italy's regional languages.
1 code implementation • 10 May 2024 • Rong Chao, Wen-Huang Cheng, Moreno La Quatra, Sabato Marco Siniscalchi, Chao-Han Huck Yang, Szu-Wei Fu, Yu Tsao
This work aims to study a scalable state-space model (SSM), Mamba, for the speech enhancement (SE) task.
Ranked #4 on
Speech Enhancement
on VoiceBank + DEMAND
1 code implementation • 2 May 2024 • Moreno La Quatra, Alkis Koudounas, Lorenzo Vaiani, Elena Baralis, Luca Cagliero, Paolo Garza, Sabato Marco Siniscalchi
Limited diversity in standardized benchmarks for evaluating audio representation learning (ARL) methods may hinder systematic comparison of current methods' capabilities.
1 code implementation • 14 Jun 2023 • Alkis Koudounas, Moreno La Quatra, Lorenzo Vaiani, Luca Colomba, Giuseppe Attanasio, Eliana Pastor, Luca Cagliero, Elena Baralis
Recent large-scale Spoken Language Understanding datasets focus predominantly on English and do not account for language-specific phenomena such as particular phonemes or words in different lects.