1 code implementation • 28 Apr 2022 • Boqing Zhu, Kele Xu, Changjian Wang, Zheng Qin, Tao Sun, Huaimin Wang, Yuxing Peng
We present an approach to learn voice-face representations from the talking face videos, without any identity labels.
no code implementations • 16 Jul 2020 • Boqing Zhu, Kele Xu, Qiuqiang Kong, Huaimin Wang, Yuxing Peng
Yet, it is labor-intensive to accurately annotate large amount of audio data, and the dataset may contain noisy labels in the practical settings.
2 code implementations • 30 Oct 2018 • Kele Xu, Boqing Zhu, Qiuqiang Kong, Haibo Mi, Bo Ding, Dezhi Wang, Huaimin Wang
Audio tagging is challenging due to the limited size of data and noisy labels.
no code implementations • 18 May 2018 • Kele Xu, Dawei Feng, Haibo Mi, Boqing Zhu, Dezhi Wang, Lilun Zhang, Hengxing Cai, Shuwen Liu
Audio scene classification, the problem of predicting class labels of audio scenes, has drawn lots of attention during the last several years.
1 code implementation • 25 Mar 2018 • Boqing Zhu, Changjian Wang, Feng Liu, Jin Lei, Zengquan Lu, Yuxing Peng
For leveraging the waveform-based features and spectrogram-based features in a single model, we introduce two-phase method to fuse the different features.
Sound Audio and Speech Processing