1 code implementation • 7 Feb 2022 • Bethan Thomas, Samuel Kessler, Salah Karout
In this paper we propose applying adapters to wav2vec 2. 0 to reduce the number of parameters required for downstream ASR tasks, and increase scalability of the model to multiple tasks or languages.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2
no code implementations • 22 Dec 2021 • Duo Wang, Salah Karout
Multi-Modal Self-Supervised Learning from videos has been shown to improve model's performance on various downstream tasks.
no code implementations • 26 Jul 2021 • Samuel Kessler, Bethan Thomas, Salah Karout
We evaluate by applying these language representations to automatic speech recognition.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +5
no code implementations • 14 Sep 2020 • Dario Fuoli, Zhiwu Huang, Shuhang Gu, Radu Timofte, Arnau Raventos, Aryan Esfandiari, Salah Karout, Xuan Xu, Xin Li, Xin Xiong, Jinge Wang, Pablo Navarrete Michelini, Wen-Hao Zhang, Dongyang Zhang, Hanwei Zhu, Dan Xia, Haoyu Chen, Jinjin Gu, Zhi Zhang, Tongtong Zhao, Shanshan Zhao, Kazutoshi Akita, Norimichi Ukita, Hrishikesh P. S, Densen Puthussery, Jiji C. V
Missing information can be restored well in this region, especially in HR videos, where the high-frequency content mostly consists of texture details.