no code implementations • 29 Apr 2025 • Yaroslav Getman, Tamás Grósz, Mikko Kurimo, Giampiero Salvi
This paper presents the "Non-native Children's Automatic Speech Assessment" (NOCASA) - a data competition part of the IEEE MLSP 2025 conference.
1 code implementation • 23 Jun 2024 • Moreno La Quatra, Maria Francesca Turco, Torbjørn Svendsen, Giampiero Salvi, Juan Rafael Orozco-Arroyave, Sabato Marco Siniscalchi
This work is concerned with devising a robust Parkinson's (PD) disease detector from speech in real-world operating conditions using (i) foundational models, and (ii) speech enhancement (SE) methods.
no code implementations • 25 Apr 2024 • Giampiero Salvi
This is done employing hidden Markov models and using the SpeechDat database to train their parameters.
no code implementations • 12 Jan 2024 • Giampiero Salvi
This paper describes the use of connectionist techniques in phonetic speech recognition with strong latency constraints.
no code implementations • 11 Jan 2024 • Giampiero Salvi
The advantage of this measure is its simplicity as the posterior probabilities of each class are available in connectionist phoneme recognition.
1 code implementation • Interspeech 2023 • Janine Rugayan, Giampiero Salvi, Torbjørn Svendsen
Second, we demonstrate that ASD is more effective than WER as an indicator of performance on downstream NLP tasks such as named entity recognition and sentiment classification.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+5
no code implementations • 13 Jul 2023 • Mohammad Adiban, Kalin Stefanov, Sabato Marco Siniscalchi, Giampiero Salvi
We address the video prediction task by putting forth a novel model that combines (i) a novel hierarchical residual learning vector quantized variational autoencoder (HR-VQVAE), and (ii) a novel autoregressive spatiotemporal predictive model (AST-PM).
1 code implementation • Interspeech 2022 • Janine Rugayan, Torbjørn Svendsen, Giampiero Salvi
In addition, we present results using Semantic Distance (SemDist), and compare them with ASD.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+5
1 code implementation • 9 Aug 2022 • Mohammad Adiban, Kalin Stefanov, Sabato Marco Siniscalchi, Giampiero Salvi
We propose a multi-layer variational autoencoder method, we call HR-VQVAE, that learns hierarchical discrete representations of the data.
1 code implementation • 11 Jun 2021 • Jerome Abdelnour, Jean Rouat, Giampiero Salvi
We also test the addition of a MALiMo module in our model on both CLEAR2 and DAQA.
no code implementations • 11 Sep 2020 • Mohammad Adiban, Arash Safari, Giampiero Salvi
In this study, we introduce a novel unsupervised countermeasure for smart grid power systems, based on generative adversarial networks (GANs).
no code implementations • 28 Feb 2019 • Jerome Abdelnour, Giampiero Salvi, Jean Rouat
The AQA task consists of analyzing an acoustic scene composed by a combination of elementary sounds and answering questions that relate the position and properties of these sounds.
1 code implementation • 26 Feb 2019 • Giovanni Saponaro, Lorenzo Jamone, Alexandre Bernardino, Giampiero Salvi
It then uses this information to learn a mapping between its own actions and those performed by a human in a shared environment.
1 code implementation • 26 Nov 2018 • Jerome Abdelnour, Giampiero Salvi, Jean Rouat
We introduce the task of acoustic question answering (AQA) in the area of acoustic reasoning.
1 code implementation • 8 Apr 2018 • Cheng Zhang, Cengiz Öztireli, Stephan Mandt, Giampiero Salvi
We first show that the phenomenon of variance reduction by diversified sampling generalizes in particular to non-stationary point processes.
1 code implementation • 27 Nov 2017 • Giampiero Salvi, Luis Montesano, Alexandre Bernardino, José Santos-Victor
The model is based on an affordance network, i. e., a mapping between robot actions, robot perceptions, and the perceived effects of these actions upon objects.
no code implementations • 24 Nov 2017 • Giovanni Saponaro, Lorenzo Jamone, Alexandre Bernardino, Giampiero Salvi
A growing field in robotics and Artificial Intelligence (AI) research is human-robot collaboration, whose target is to enable effective teamwork between humans and robots.
no code implementations • 24 Nov 2017 • Kalin Stefanov, Jonas Beskow, Giampiero Salvi
Active speaker detection is a fundamental prerequisite for any artificial cognitive system attempting to acquire language in social settings.
no code implementations • 3 Oct 2016 • Akash Kumar Dhaka, Giampiero Salvi
We propose the application of a semi-supervised learning method to improve the performance of acoustic modelling for automatic speech recognition based on deep neural net- works.
no code implementations • 29 Jun 2016 • Akash Kumar Dhaka, Giampiero Salvi
We present a systematic analysis on the performance of a phonetic recogniser when the window of input features is not symmetric with respect to the current frame.
no code implementations • LREC 2014 • Niklas Vanhainen, Giampiero Salvi
This paper presents results for large vocabulary continuous speech recognition (LVCSR) in Swedish.
no code implementations • LREC 2014 • Giampiero Salvi, Niklas Vanhainen
This paper presents a plugin that adds automatic speech recognition (ASR) functionality to the WaveSurfer sound manipulation and visualisation program.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+1