Search Results for author: Moreno La Quatra

Found 13 papers, 12 papers with code

End-to-end Training For Financial Report Summarization

1 code implementation FNP (COLING) 2020 Moreno La Quatra, Luca Cagliero

Quoted companies are requested to periodically publish financial reports in textual form.

MVP: Multi-source Voice Pathology detection

1 code implementation26 May 2025 Alkis Koudounas, Moreno La Quatra, Gabriele Ciravegna, Marco Fantini, Erika Crosetti, Giovanni Succo, Tania Cerquitelli, Sabato Marco Siniscalchi, Elena Baralis

Voice disorders significantly impact patient quality of life, yet non-invasive automated diagnosis remains under-explored due to both the scarcity of pathological voice data, and the variability in recording sources.

Sentence Voice pathology detection

DeepDialogue: A Multi-Turn Emotionally-Rich Spoken Dialogue Dataset

no code implementations26 May 2025 Alkis Koudounas, Moreno La Quatra, Elena Baralis

Recent advances in conversational AI have demonstrated impressive capabilities in single-turn responses, yet multi-turn dialogues remain challenging for even the most sophisticated language models.

Philosophy

"KAN you hear me?" Exploring Kolmogorov-Arnold Networks for Spoken Language Understanding

1 code implementation26 May 2025 Alkis Koudounas, Moreno La Quatra, Eliana Pastor, Sabato Marco Siniscalchi, Elena Baralis

Kolmogorov-Arnold Networks (KANs) have recently emerged as a promising alternative to traditional neural architectures, yet their application to speech processing remains under explored.

Kolmogorov-Arnold Networks Spoken Language Understanding

Exploring Generative Error Correction for Dysarthric Speech Recognition

1 code implementation26 May 2025 Moreno La Quatra, Alkis Koudounas, Valerio Mario Salerno, Sabato Marco Siniscalchi

Despite the remarkable progress in end-to-end Automatic Speech Recognition (ASR) engines, accurately transcribing dysarthric speech remains a major challenge.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

voc2vec: A Foundation Model for Non-Verbal Vocalization

1 code implementation22 Feb 2025 Alkis Koudounas, Moreno La Quatra, Marco Sabato Siniscalchi, Elena Baralis

In this work, we aim to overcome the above shortcoming and propose a novel foundation model, termed voc2vec, specifically designed for non-verbal human data leveraging exclusively open-source non-verbal audio datasets.

model

FlanEC: Exploring Flan-T5 for Post-ASR Error Correction

1 code implementation22 Jan 2025 Moreno La Quatra, Valerio Mario Salerno, Yu Tsao, Sabato Marco Siniscalchi

In this paper, we present an encoder-decoder model leveraging Flan-T5 for post-Automatic Speech Recognition (ASR) Generative Speech Error Correction (GenSEC), and we refer to it as FlanEC.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection from Speech in Real-World Operative Conditions

1 code implementation23 Jun 2024 Moreno La Quatra, Maria Francesca Turco, Torbjørn Svendsen, Giampiero Salvi, Juan Rafael Orozco-Arroyave, Sabato Marco Siniscalchi

This work is concerned with devising a robust Parkinson's (PD) disease detector from speech in real-world operating conditions using (i) foundational models, and (ii) speech enhancement (SE) methods.

Parkinson Detection from Speech Speech Enhancement

Speech Analysis of Language Varieties in Italy

1 code implementation22 Jun 2024 Moreno La Quatra, Alkis Koudounas, Elena Baralis, Sabato Marco Siniscalchi

We leverage self-supervised learning models to tackle this task and analyze differences and similarities between Italy's regional languages.

Contrastive Learning Self-Supervised Learning

Benchmarking Representations for Speech, Music, and Acoustic Events

1 code implementation2 May 2024 Moreno La Quatra, Alkis Koudounas, Lorenzo Vaiani, Elena Baralis, Luca Cagliero, Paolo Garza, Sabato Marco Siniscalchi

Limited diversity in standardized benchmarks for evaluating audio representation learning (ARL) methods may hinder systematic comparison of current methods' capabilities.

Audio Classification Benchmarking +2

ITALIC: An Italian Intent Classification Dataset

1 code implementation14 Jun 2023 Alkis Koudounas, Moreno La Quatra, Lorenzo Vaiani, Luca Colomba, Giuseppe Attanasio, Eliana Pastor, Luca Cagliero, Elena Baralis

Recent large-scale Spoken Language Understanding datasets focus predominantly on English and do not account for language-specific phenomena such as particular phonemes or words in different lects.

Classification intent-classification +4

Cannot find the paper you are looking for? You can Submit a new open access paper.