no code implementations • TRITON 2021 • Susana Rodriguez, Roberto Gretter, Marco Matassoni, Alvaro Alonso, Oscar Corcho, Mariano Rico, Falavigna Daniele
We present a system to support simultaneous interpreting in specific domains.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4
no code implementations • NAACL (BEA) 2022 • Stefano Bannò, Marco Matassoni
The growing demand for learning English as a second language has led to an increasing interest in automatic approaches for assessing spoken language proficiency.
no code implementations • 16 Nov 2022 • Stefano Bannò, Kate M. Knill, Marco Matassoni, Vyas Raina, Mark J. F. Gales
Though the wav2vec 2. 0 based system is found to be sensitive to the nature of the response, it can be configured to yield comparable performance to systems requiring a speech transcription, and yields gains when appropriately combined with standard approaches.
no code implementations • 24 Oct 2022 • Stefano Bannò, Marco Matassoni
The increasing demand for learning English as a second language has led to a growing interest in methods for automatically assessing spoken language proficiency.
no code implementations • MTSummit 2021 • Roberto Gretter, Marco Matassoni, Daniele Falavigna
We address the problem of language model customization in applications where the ASR component needs to manage domain-specific terminology; although current state-of-the-art speech recognition technology provides excellent results for generic domains, the adaptation to specialized dictionaries or glossaries is still an open issue.
no code implementations • 23 Jun 2021 • Sara Papi, Edmondo Trentin, Roberto Gretter, Marco Matassoni, Daniele Falavigna
The paper copes with the task of automatic assessment of second language proficiency from the language learners' spoken responses to test prompts.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 13 Apr 2021 • Nina Hosseini-Kivanani, Roberto Gretter, Marco Matassoni, Giuseppe Daniele Falavigna
In this study, we develop a mispronunciation assessment system that checks the pronunciation of non-native English speakers, identifies the commonly mispronounced phonemes of Italian learners of English, and presents an evaluation of the non-native pronunciation observed in phonetically annotated speech corpora.
1 code implementation • 6 Apr 2021 • Samuele Cornell, Alessio Brutti, Marco Matassoni, Stefano Squartini
Fully exploiting ad-hoc microphone networks for distant speech recognition is still an open issue.
no code implementations • LREC 2020 • Ornella Mich, Nadia Mana, Roberto Gretter, Marco Matassoni, Daniele Falavigna
Our system, based on ASR technology, implements the Cornoldi{'}s MT battery, which is a well-known Italian test to assess reading skills.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • LREC 2020 • Roberto Gretter, Marco Matassoni, Stefano Bannò, Daniele Falavigna
This paper describes "TLT-school" a corpus of speech utterances collected in schools of northern Italy for assessing the performance of students learning both English and German.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +1
no code implementations • 25 Sep 2018 • Marco Matassoni, Roberto Gretter, Daniele Falavigna, Diego Giuliani
This work deals with non-native children's speech and investigates both multi-task and transfer learning approaches to adapt a multi-language Deep Neural Network (DNN) to speakers, specifically children, learning a foreign language.
no code implementations • 22 Jun 2017 • Shahab Jalalvand, Matteo Negri, Daniele Falavigna, Marco Matassoni, Marco Turchi
In this paper we propose a novel variant of ROVER that takes advantage of ASR quality estimation (QE) for ranking the transcriptions at "segment level" instead of: i) relying on confidence scores, or ii) feeding ROVER with randomly ordered hypotheses.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 6 Feb 2017 • Daniele Falavigna, Marco Matassoni, Shahab Jalalvand, Matteo Negri, Marco Turchi
Our hypothesis is that significant improvements can be achieved by: i)automatically transcribing the evaluation data we are currently trying to recognise, and ii) selecting from it a subset of "good quality" instances based on the word error rate (WER) scores predicted by a QE component.