no code implementations • 24 Nov 2023 • Mingze Wang, Zeping Min, Lei Wu
Inspired by this analysis, we propose a novel algorithm called Progressive Rescaling Gradient Descent (PRGD) and show that PRGD can maximize the margin at an {\em exponential rate}.
no code implementations • 13 Jul 2023 • Zeping Min, Jinbo Wang
This paper explores the integration of Large Language Models (LLMs) into Automatic Speech Recognition (ASR) systems to improve transcription accuracy.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
no code implementations • 5 Feb 2023 • Zeping Min, Qian Ge, Zhong Li, Weinan E
Furthermore, in the ASR task, MAC beats wav2vec2 (with fine-tuning) on common voice datasets of Cantonese and gets really competitive results on common voice datasets of Taiwanese and Japanese.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 1 Feb 2023 • Zeping Min
Transformers have achieved great success in machine translation, but transformer-based NMT models often require millions of bilingual parallel corpus for training.
no code implementations • 18 Nov 2022 • Zeping Min, Qian Ge, Cheng Tai
The core idea of the pseudo label based semi-supervised learning algorithm is to use the model trained on the labeled data to generate pseudo labels on the unlabeled data, and then train a model to fit the previously generated pseudo labels.
no code implementations • 27 Oct 2022 • Zeping Min, Qian Ge, Guanhua Huang
In this paper, we propose a novel Siamese Adversarial Network (SAN) architecture for automatic speech recognition, which aims at solving the difficulty of fuzzy audio recognition.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2