1 code implementation • 4 Jan 2023 • Maxime Burchi, Radu Timofte
We improve previous lip reading methods using an Efficient Conformer back-end on top of a ResNet-18 visual front-end and by adding intermediate CTC losses between blocks.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3
2 code implementations • 22 Sep 2022 • Marcos V. Conde, Ui-Jin Choi, Maxime Burchi, Radu Timofte
Using this method we can tackle the major issues in training transformer vision models, such as training instability, resolution gaps between pre-training and fine-tuning, and hunger on data.
1 code implementation • 27 Apr 2022 • Marcos V. Conde, Maxime Burchi, Radu Timofte
Learning-based approaches for perceptual image quality assessment (IQA) usually require both the distorted and reference image for measuring the perceptual quality accurately.
1 code implementation • 31 Aug 2021 • Maxime Burchi, Valentin Vielzeuf
The recently proposed Conformer architecture has shown state-of-the-art performances in Automatic Speech Recognition by combining convolution with attention to model both local and global dependencies.
Automatic Speech Recognition Automatic Speech Recognition (ASR) +2