Search Results for author: Wenjiang Zhou

Found 2 papers, 1 papers with code

MuseTalk: Real-Time High Quality Lip Synchronization with Latent Space Inpainting

1 code implementation14 Oct 2024 Yue Zhang, Minhao Liu, Zhaokang Chen, Bin Wu, Yubin Zeng, Chao Zhan, Yingjie He, Junxin Huang, Wenjiang Zhou

We propose MuseTalk, which generates lip-sync targets in a latent space encoded by a Variational Autoencoder, enabling high-fidelity talking face video generation with efficient inference.

Video Generation

MBTFNet: Multi-Band Temporal-Frequency Neural Network For Singing Voice Enhancement

no code implementations6 Oct 2023 Weiming Xu, Zhouxuan Chen, Zhili Tan, Shubo Lv, Runduo Han, Wenjiang Zhou, Weifeng Zhao, Lei Xie

A typical neural speech enhancement (SE) approach mainly handles speech and noise mixtures, which is not optimal for singing voice enhancement scenarios.

Music Source Separation Speech Enhancement

Cannot find the paper you are looking for? You can Submit a new open access paper.