Search Results for author: Boqing Zhu

Found 5 papers, 3 papers with code

Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast

1 code implementation • 28 Apr 2022 • Boqing Zhu, Kele Xu, Changjian Wang, Zheng Qin, Tao Sun, Huaimin Wang, Yuxing Peng

We present an approach to learn voice-face representations from the talking face videos, without any identity labels.

Contrastive Learning Representation Learning

Paper
Code

Audio Tagging by Cross Filtering Noisy Labels

no code implementations • 16 Jul 2020 • Boqing Zhu, Kele Xu, Qiuqiang Kong, Huaimin Wang, Yuxing Peng

Yet, it is labor-intensive to accurately annotate large amount of audio data, and the dataset may contain noisy labels in the practical settings.

Audio Tagging Memorization +1

Paper
Add Code

General audio tagging with ensembling convolutional neural network and statistical features

2 code implementations • 30 Oct 2018 • Kele Xu, Boqing Zhu, Qiuqiang Kong, Haibo Mi, Bo Ding, Dezhi Wang, Huaimin Wang

Audio tagging is challenging due to the limited size of data and noisy labels.

Audio Tagging Descriptive +2

Paper
Code

Mixup-Based Acoustic Scene Classification Using Multi-Channel Convolutional Neural Network

no code implementations • 18 May 2018 • Kele Xu, Dawei Feng, Haibo Mi, Boqing Zhu, Dezhi Wang, Lilun Zhang, Hengxing Cai, Shuwen Liu

Audio scene classification, the problem of predicting class labels of audio scenes, has drawn lots of attention during the last several years.

Acoustic Scene Classification Classification +2

Paper
Add Code

Learning Environmental Sounds with Multi-scale Convolutional Neural Network

1 code implementation • 25 Mar 2018 • Boqing Zhu, Changjian Wang, Feng Liu, Jin Lei, Zengquan Lu, Yuxing Peng

For leveraging the waveform-based features and spectrogram-based features in a single model, we introduce two-phase method to fuse the different features.

Sound Audio and Speech Processing

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.