Search Results for author: Mou Wang

Found 12 papers, 2 papers with code

Sound event localization and classification using WASN in Outdoor Environment

no code implementations • 29 Mar 2024 • Dongzhe Zhang, Jianfeng Chen, Jisheng Bai, Mou Wang

Moreover, methods using multiple microphone arrays often focus solely on source localization, neglecting the aspect of sound event classification.

Classification

Paper
Add Code

Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain Shift

1 code implementation • 5 Feb 2024 • Jisheng Bai, Mou Wang, Haohe Liu, Han Yin, Yafei Jia, Siwei Huang, Yutong Du, Dongzhe Zhang, Dongyuan Shi, Woon-Seng Gan, Mark D. Plumbley, Susanto Rahardja, Bin Xiang, Jianfeng Chen

In addition, considering the abundance of unlabeled acoustic scene data in the real world, it is important to study the possible ways to utilize these unlabelled data.

Acoustic Scene Classification Scene Classification

Paper
Code

Sub-band and Full-band Interactive U-Net with DPRNN for Demixing Cross-talk Stereo Music

no code implementations • 11 Jan 2024 • Han Yin, Mou Wang, Jisheng Bai, Dongyuan Shi, Woon-Seng Gan, Jianfeng Chen

This paper presents a detailed description of our proposed methods for the ICASSP 2024 Cadenza Challenge.

Paper
Add Code

Interactive Dual-Conformer with Scene-Inspired Mask for Soft Sound Event Detection

no code implementations • 23 Nov 2023 • Han Yin, Jisheng Bai, Mou Wang, Dongyuan Shi, Woon-Seng Gan, Jianfeng Chen

In this paper, we first propose an interactive dual-conformer (IDC) module, in which a cross-interaction mechanism is applied to effectively exploit the information from soft labels.

Event Detection Sound Event Detection

Paper
Add Code

AudioLog: LLMs-Powered Long Audio Logging with Hybrid Token-Semantic Contrastive Learning

1 code implementation • 21 Nov 2023 • Jisheng Bai, Han Yin, Mou Wang, Dongyuan Shi, Woon-Seng Gan, Jianfeng Chen, Susanto Rahardja

This paper presents AudioLog, a large language models (LLMs)-powered audio logging system with hybrid token-semantic contrastive learning.

Acoustic Scene Classification Audio captioning +5

Paper
Code

Convolutional Recurrent Neural Network with Attention for 3D Speech Enhancement

no code implementations • 8 Jun 2023 • Han Yin, Jisheng Bai, Mou Wang, Siwei Huang, Yafei Jia, Jianfeng Chen

3D speech enhancement can effectively improve the auditory experience and plays a crucial role in augmented reality technology.

Denoising Speech Enhancement

Paper
Add Code

Solving 3D Radar Imaging Inverse Problems with a Multi-cognition Task-oriented Framework

no code implementations • 28 Nov 2022 • Xu Zhan, Xiaoling Zhang, Mou Wang, Jun Shi, Shunjun Wei, Tianjiao Zeng

Current methods obtain undifferentiated results that suffer task-depended information retrieval loss and thus don't meet the task's specific demands well.

Information Retrieval Retrieval

Paper
Add Code

SSDPT: Self-Supervised Dual-Path Transformer for Anomalous Sound Detection in Machine Condition Monitoring

no code implementations • 6 Aug 2022 • Jisheng Bai, Jianfeng Chen, Mou Wang, Muhammad Saad Ayub, Qingli Yan

In this article, we propose a self-supervised dual-path Transformer (SSDPT) network to detect anomalous sounds in machine monitoring.

Self-Supervised Learning

Paper
Add Code

A Squeeze-and-Excitation and Transformer based Cross-task System for Environmental Sound Recognition

no code implementations • 16 Mar 2022 • Jisheng Bai, Jianfeng Chen, Mou Wang, Muhammad Saad Ayub

Evaluations for the three tasks are conducted on the recent databases of detection and classification of acoustic scenes and event challenges.

Acoustic Scene Classification Data Augmentation +1

Paper
Add Code

A comparison of handcrafted, parameterized, and learnable features for speech separation

no code implementations • 29 Nov 2020 • Wenbo Zhu, Mou Wang, Xiao-Lei Zhang, Susanto Rahardja

Among them, learnable features, which are trained with separation networks jointly in an end-to-end fashion, become a new trend of modern speech separation research, e. g. convolutional time domain audio separation network (Conv-Tasnet), while handcrafted and parameterized features are also shown competitive in very recent studies.

Sound

Paper
Add Code

Multimodal Urban Sound Tagging with Spatiotemporal Context

no code implementations • 31 Oct 2020 • Jisheng Bai, Jianfeng Chen, Mou Wang

Noise pollution significantly affects our daily life and urban development.

Paper
Add Code

Hyperspectral Unmixing via Deep Autoencoder Networks for a Generalized Linear-Mixture/Nonlinear-Fluctuation Model

no code implementations • 30 Apr 2019 • Min Zhao, Mou Wang, Jie Chen, Susanto Rahardja

This paper presents an unsupervised nonlinear spectral unmixing method based on a deep autoencoder network that applies to a generalized linear-mixture/nonlinear fluctuation model, consisting of a linear mixture component and an additive nonlinear mixture component that depends on both endmembers and abundances.

Hyperspectral Unmixing

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.