Search Results for author: Xiaoming Ren

Found 3 papers, 1 papers with code

LoopAnimate: Loopable Salient Object Animation

no code implementations14 Apr 2024 Fanyi Wang, Peng Liu, Haotian Hu, Dan Meng, Jingwen Su, Jinjin Xu, Yanhao Zhang, Xiaoming Ren, Zhiwang Zhang

The proposed LoopAnimate, which for the first time extends the single-pass generation length of UNet-based video generation models to 35 frames while maintaining high-quality video generation.

Object Video Generation

Practice of the conformer enhanced AUDIO-VISUAL HUBERT on Mandarin and English

no code implementations28 Feb 2023 Xiaoming Ren, Chao Li, Shenjian Wang, Biao Li

Considering the bimodal nature of human speech perception, lips, and teeth movement has a pivotal role in automatic speech recognition.

Automatic Speech Recognition speech-recognition +1

Improving Mandarin Speech Recogntion with Block-augmented Transformer

2 code implementations24 Jul 2022 Xiaoming Ren, Huifeng Zhu, Liuwei Wei, Minghui Wu, Jie Hao

In this work, we believe that the output information of each block in the encoder and decoder is not completely inclusive, in other words, their output information may be complementary.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Cannot find the paper you are looking for? You can Submit a new open access paper.