Search Results for author: Xiaoming Ren

Found 3 papers, 1 papers with code

LoopAnimate: Loopable Salient Object Animation

no code implementations • 14 Apr 2024 • Fanyi Wang, Peng Liu, Haotian Hu, Dan Meng, Jingwen Su, Jinjin Xu, Yanhao Zhang, Xiaoming Ren, Zhiwang Zhang

The proposed LoopAnimate, which for the first time extends the single-pass generation length of UNet-based video generation models to 35 frames while maintaining high-quality video generation.

Object Video Generation

Paper
Add Code

Practice of the conformer enhanced AUDIO-VISUAL HUBERT on Mandarin and English

no code implementations • 28 Feb 2023 • Xiaoming Ren, Chao Li, Shenjian Wang, Biao Li

Considering the bimodal nature of human speech perception, lips, and teeth movement has a pivotal role in automatic speech recognition.

Automatic Speech Recognition speech-recognition +1

Paper
Add Code

Improving Mandarin Speech Recogntion with Block-augmented Transformer

2 code implementations • 24 Jul 2022 • Xiaoming Ren, Huifeng Zhu, Liuwei Wei, Minghui Wu, Jie Hao

In this work, we believe that the output information of each block in the encoder and decoder is not completely inclusive, in other words, their output information may be complementary.

Ranked #4 on Speech Recognition on AISHELL-1

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.