no code implementations • 6 Dec 2023 • Yanxiong Li, Zhongjie Jiang, Qisheng Huang, Wenchang Cao, Jialong Li
The features that are output from current block of the model are processed according to the steps above before they are fed into the next block of the model.
no code implementations • 9 Jun 2023 • Yufei Zeng, Yanxiong Li, Zhenfeng Zhou, Ruiqi Wang, Difeng Lu
Domestic activities classification (DAC) from audio recordings aims at classifying audio recordings into pre-defined categories of domestic activities, which is an effective way for estimation of daily activities performed in home environment.
no code implementations • 9 Jun 2023 • Yanxiong Li, Mingle Liu, Wucheng Wang, Yuhan Zhang, Qianhua He
In this study, we propose a method for acoustic scene clustering that jointly optimizes the procedures of feature learning and clustering iteration.
no code implementations • 3 Jun 2023 • Yanxiong Li, Wenchang Cao, Wei Xie, Qisheng Huang, Wenfeng Pang, Qianhua He
This subtask focuses on classifying audio samples of multiple devices with a low-complexity model, where two main difficulties need to be overcome.
1 code implementation • 3 Jun 2023 • Yanxiong Li, Wenchang Cao, Jialong Li, Wei Xie, Qianhua He
It is generally assumed that number of classes is fixed in current audio classification methods, and the model can recognize pregiven classes only.
no code implementations • 1 Jun 2023 • Yanxiong Li, Zhongjie Jiang, Wenchang Cao, Qisheng Huang
The proposed method is compared with state-of-the-art methods for speaker verification.
no code implementations • 31 May 2023 • Yanxiong Li, Hao Chen, Wenchang Cao, Qisheng Huang, Qianhua He
In the proposed embedding module, audio feature of each speech sample is split into several low-dimensional feature subsets that are transformed by a recurrent convolutional block in parallel.
1 code implementation • 31 May 2023 • Yanxiong Li, Wenchang Cao, Wei Xie, Jialong Li, Emmanouil Benetos
Labeled support samples and unlabeled query samples are used to train the prototype adaptation network and update the classifier, since they are informative for audio classification.
1 code implementation • 4 Aug 2022 • Yanxiong Li, Wenchang Cao, Konstantinos Drossos, Tuomas Virtanen
Automatic estimation of domestic activities from audio can be used to solve many problems, such as reducing the labor cost for nursing the elderly people.
no code implementations • 24 Apr 2022 • Yanxiong Li, Wucheng Wang, Hao Chen, Wenchang Cao, Wei Li, Qianhua He
Although few-shot learning has attracted much attention from the fields of image and audio classification, few efforts have been made on few-shot speaker identification.
no code implementations • 8 May 2021 • Ziheng Lin, Yanxiong Li, Zhangjin Huang, WenHao Zhang, Yufeng Tan, YiChun Chen, Qianhua He
Domestic activities clustering from audio recordings aims at merging audio clips which belong to the same class of domestic activity into a single cluster.
1 code implementation • 2 Feb 2020 • Konstantinos Drossos, Stylianos I. Mimilakis, Shayan Gharib, Yanxiong Li, Tuomas Virtanen
The number of the channels of the CNNs and size of the weight matrices of the RNNs have a direct effect on the total amount of parameters of the SED method, which is to a couple of millions.