Search Results for author: Yicheng Gu

Found 2 papers, 2 papers with code

Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation

1 code implementation7 Jul 2024 Haorui He, Zengqiang Shang, Chaoren Wang, Xuyuan Li, Yicheng Gu, Hua Hua, Liwei Liu, Chen Yang, Jiaqi Li, Peiyang Shi, Yuancheng Wang, Kai Chen, Pengyuan Zhang, Zhizheng Wu

To facilitate the scale-up of Emilia, we also present Emilia-Pipe, the first open-source preprocessing pipeline designed to efficiently transform raw, in-the-wild speech data into high-quality training data with speech annotations.

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds

1 code implementation1 Jul 2024 Yiming Zhang, Yicheng Gu, Yanhong Zeng, Zhening Xing, Yuancheng Wang, Zhizheng Wu, Kai Chen

Meanwhile, the temporal controller incorporates an onset detector and a timestampbased adapter to achieve precise audio-video alignment.

Audio Generation Video Alignment +1

Cannot find the paper you are looking for? You can Submit a new open access paper.