1 code implementation • 10 Sep 2024 • Zhongweiyang Xu, Debottam Dutta, Yu-Lin Wei, Romit Roy Choudhury
Its goal is to use one single diffusion model to generate mutually-coherent music sources, that are then mixed to form the music.
no code implementations • 2 Oct 2023 • Muqiao Yang, Chunlei Zhang, Yong Xu, Zhongweiyang Xu, Heming Wang, Bhiksha Raj, Dong Yu
Speech enhancement aims to improve the quality of speech signals in terms of quality and intelligibility, and speech editing refers to the process of editing the speech according to specific user needs.
no code implementations • 16 Sep 2023 • Heming Wang, Meng Yu, Hao Zhang, Chunlei Zhang, Zhongweiyang Xu, Muqiao Yang, Yixuan Zhang, Dong Yu
Enhancing speech signal quality in adverse acoustic environments is a persistent challenge in speech processing.
no code implementations • 9 Jul 2022 • Zhongweiyang Xu, Xulin Fan, Mark Hasegawa-Johnson
Most current research upsamples the visual features along the time dimension so that audio and video features are able to align in time.
no code implementations • 9 Jul 2022 • Zhongweiyang Xu, Romit Roy Choudhury
We consider the problem of audio voice separation for binaural applications, such as earphones and hearing aids.