1 code implementation • 13 Mar 2024 • Heitor R. Guimarães, Arthur Pimentel, Anderson R. Avila, Mehdi Rezagholizadeh, Boxing Chen, Tiago H. Falk
Lastly, we show that the proposed recipe can be applied to other distillation methodologies, such as the recent DPWavLM.
no code implementations • 25 Sep 2023 • Arthur Pimentel, Heitor Guimarães, Anderson R. Avila, Mehdi Rezagholizadeh, Tiago H. Falk
Recent advances with self-supervised learning have allowed speech recognition systems to achieve state-of-the-art (SOTA) word error rates (WER) while requiring only a fraction of the labeled training data needed by its predecessors.
no code implementations • 12 Jun 2023 • Anderson R. Avila, Mehdi Rezagholizadeh, Chao Xing
In this work, we investigate impacts of this ASR error propagation on state-of-the-art NLU systems based on pre-trained language models (PLM), such as BERT and RoBERTa.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • 18 Feb 2023 • Heitor R. Guimarães, Arthur Pimentel, Anderson R. Avila, Mehdi Rezagholizadeh, Boxing Chen, Tiago H. Falk
The proposed layer-wise distillation recipe is evaluated on top of three well-established universal representations, as well as with three downstream tasks.
no code implementations • 12 Nov 2022 • Heitor R. Guimarães, Arthur Pimentel, Anderson R. Avila, Mehdi Rezagholizadeh, Tiago H. Falk
Self-supervised speech representation learning aims to extract meaningful factors from the speech signal that can later be used across different downstream tasks, such as speech and/or emotion recognition.
no code implementations • 15 Jul 2022 • Anderson R. Avila, Khalil Bibi, Rui Heng Yang, Xinlin Li, Chao Xing, Xiao Chen
Deep neural networks (DNN) have achieved impressive success in multiple domains.
no code implementations • 8 Jun 2021 • Yiran Cao, Nihal Potdar, Anderson R. Avila
Such approaches allow for the extraction of semantic information directly from the speech signal, thus bypassing the need for a transcript from an automatic speech recognition (ASR) system.
Ranked #11 on Spoken Language Understanding on Fluent Speech Commands (using extra training data)
Automatic Speech Recognition Automatic Speech Recognition (ASR) +4
no code implementations • 20 May 2021 • Nihal Potdar, Anderson R. Avila, Chao Xing, Dong Wang, Yiran Cao, Xiao Chen
In this paper, we propose a streaming end-to-end framework that can process multiple intentions in an online and incremental way.