no code implementations • 25 Jun 2024 • Yongliang Wu, Bozheng Li, Jiawang Cao, Wenbo Zhu, Yi Lu, Weiheng Chi, Chuyun Xie, Haolin Zheng, Ziyue Su, Jay Wu, Xu Yang
The Long-form Video Question-Answering task requires the comprehension and analysis of extended video content to respond accurately to questions by utilizing both temporal and contextual information.
no code implementations • 24 May 2024 • Yongliang Wu, Shiji Zhou, Mingzhuo Yang, Lianzhe Wang, Wenbo Zhu, Heng Chang, Xiao Zhou, Xu Yang
Current text-to-image diffusion models have achieved groundbreaking results in image generation tasks.
no code implementations • 10 Mar 2024 • Jiawang Cao, Yongliang Wu, Weiheng Chi, Wenbo Zhu, Ziyue Su, Jay Wu
The proliferation of mobile devices and social media has revolutionized content dissemination, with short-form video becoming increasingly prevalent.
no code implementations • 8 Nov 2023 • Wenbo Zhu, Tiechuan Hu
In this paper, we look at a database of tweets sorted by various keywords that could indicate the users sentiment towards covid vaccines.
no code implementations • 28 Jun 2023 • Aoqi Guo, Junnan Wu, Peng Gao, Wenbo Zhu, Qinwen Guo, Dazhi Gao, Yujun Wang
In this paper, we propose a target speech extraction network that utilizes spatial information to enhance the performance of neural beamformer.
1 code implementation • 28 Mar 2021 • Shanzheng Guan, Shupei Liu, Junqi Chen, Wenbo Zhu, Shengqiang Li, Xu Tan, Ziye Yang, Menglong Xu, Yijiang Chen, Jianyu Wang, Xiao-Lei Zhang
We trained several multi-device speech recognition systems on both the Libri-adhoc40 dataset and a simulated dataset.
no code implementations • 29 Nov 2020 • Wenbo Zhu, Mou Wang, Xiao-Lei Zhang, Susanto Rahardja
Among them, learnable features, which are trained with separation networks jointly in an end-to-end fashion, become a new trend of modern speech separation research, e. g. convolutional time domain audio separation network (Conv-Tasnet), while handcrafted and parameterized features are also shown competitive in very recent studies.
Sound