no code implementations • 29 Mar 2024 • Dongzhe Zhang, Jianfeng Chen, Jisheng Bai, Mou Wang
Moreover, methods using multiple microphone arrays often focus solely on source localization, neglecting the aspect of sound event classification.
1 code implementation • 5 Feb 2024 • Jisheng Bai, Mou Wang, Haohe Liu, Han Yin, Yafei Jia, Siwei Huang, Yutong Du, Dongzhe Zhang, Dongyuan Shi, Woon-Seng Gan, Mark D. Plumbley, Susanto Rahardja, Bin Xiang, Jianfeng Chen
In addition, considering the abundance of unlabeled acoustic scene data in the real world, it is important to study the possible ways to utilize these unlabelled data.
no code implementations • 11 Jan 2024 • Han Yin, Mou Wang, Jisheng Bai, Dongyuan Shi, Woon-Seng Gan, Jianfeng Chen
This paper presents a detailed description of our proposed methods for the ICASSP 2024 Cadenza Challenge.
no code implementations • 23 Nov 2023 • Han Yin, Jisheng Bai, Mou Wang, Dongyuan Shi, Woon-Seng Gan, Jianfeng Chen
In this paper, we first propose an interactive dual-conformer (IDC) module, in which a cross-interaction mechanism is applied to effectively exploit the information from soft labels.
1 code implementation • 21 Nov 2023 • Jisheng Bai, Han Yin, Mou Wang, Dongyuan Shi, Woon-Seng Gan, Jianfeng Chen, Susanto Rahardja
This paper presents AudioLog, a large language models (LLMs)-powered audio logging system with hybrid token-semantic contrastive learning.
no code implementations • 8 Jun 2023 • Han Yin, Jisheng Bai, Mou Wang, Siwei Huang, Yafei Jia, Jianfeng Chen
3D speech enhancement can effectively improve the auditory experience and plays a crucial role in augmented reality technology.
no code implementations • 28 Nov 2022 • Xu Zhan, Xiaoling Zhang, Mou Wang, Jun Shi, Shunjun Wei, Tianjiao Zeng
Current methods obtain undifferentiated results that suffer task-depended information retrieval loss and thus don't meet the task's specific demands well.
no code implementations • 6 Aug 2022 • Jisheng Bai, Jianfeng Chen, Mou Wang, Muhammad Saad Ayub, Qingli Yan
In this article, we propose a self-supervised dual-path Transformer (SSDPT) network to detect anomalous sounds in machine monitoring.
no code implementations • 16 Mar 2022 • Jisheng Bai, Jianfeng Chen, Mou Wang, Muhammad Saad Ayub
Evaluations for the three tasks are conducted on the recent databases of detection and classification of acoustic scenes and event challenges.
no code implementations • 29 Nov 2020 • Wenbo Zhu, Mou Wang, Xiao-Lei Zhang, Susanto Rahardja
Among them, learnable features, which are trained with separation networks jointly in an end-to-end fashion, become a new trend of modern speech separation research, e. g. convolutional time domain audio separation network (Conv-Tasnet), while handcrafted and parameterized features are also shown competitive in very recent studies.
Sound
no code implementations • 31 Oct 2020 • Jisheng Bai, Jianfeng Chen, Mou Wang
Noise pollution significantly affects our daily life and urban development.
no code implementations • 30 Apr 2019 • Min Zhao, Mou Wang, Jie Chen, Susanto Rahardja
This paper presents an unsupervised nonlinear spectral unmixing method based on a deep autoencoder network that applies to a generalized linear-mixture/nonlinear fluctuation model, consisting of a linear mixture component and an additive nonlinear mixture component that depends on both endmembers and abundances.