no code implementations • 14 Apr 2024 • Fanyi Wang, Peng Liu, Haotian Hu, Dan Meng, Jingwen Su, Jinjin Xu, Yanhao Zhang, Xiaoming Ren, Zhiwang Zhang
The proposed LoopAnimate, which for the first time extends the single-pass generation length of UNet-based video generation models to 35 frames while maintaining high-quality video generation.
no code implementations • 28 Feb 2023 • Xiaoming Ren, Chao Li, Shenjian Wang, Biao Li
Considering the bimodal nature of human speech perception, lips, and teeth movement has a pivotal role in automatic speech recognition.
2 code implementations • 24 Jul 2022 • Xiaoming Ren, Huifeng Zhu, Liuwei Wei, Minghui Wu, Jie Hao
In this work, we believe that the output information of each block in the encoder and decoder is not completely inclusive, in other words, their output information may be complementary.
Ranked #6 on Speech Recognition on AISHELL-1
Automatic Speech Recognition Automatic Speech Recognition (ASR) +3