no code implementations • 7 Mar 2023 • Kang Li, Yan Song, Li-Rong Dai, Ian McLoughlin, Xin Fang, Lin Liu
In this paper, we propose an effective sound event detection (SED) method based on the audio spectrogram transformer (AST) model, pretrained on the large-scale AudioSet for audio tagging (AT) task, termed AST-SED.
no code implementations • 2 Nov 2022 • Jie Bai, Xin Fang, Jianwu Fang, Jianru Xue, Changwei Yuan
To this end, we formulate a deep virtual to real distillation framework by introducing the synthetic data that can be generated conveniently, and borrow the abundant information of pedestrian movement in synthetic videos for the pedestrian crossing prediction in real data with a simple and lightweight implementation.
no code implementations • 20 Sep 2022 • Shuan Dong, Xin Fang, Jin Tan, Ningchao Gao, Xiaofan Cui, Anderson Hoke
The simulation results in the IEEE 39-bus system with different types of FFR demonstrate that the proposed method provides an accurate and fast prediction of the frequency nadir under various disturbances.
no code implementations • 27 May 2022 • Xiaofei Wang, Fangxing Li, Linquan Bai, Xin Fang
The DLMP provides a solution that can be essential for competitive market operation in future distribution systems.
no code implementations • 5 Apr 2022 • Ye-Qian Du, Jie Zhang, Qiu-Shi Zhu, Li-Rong Dai, Ming-Hui Wu, Xin Fang, Zhou-Wang Yang
Unpaired data has shown to be beneficial for low-resource automatic speech recognition~(ASR), which can be involved in the design of hybrid models with multi-task training or language model dependent pre-training.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+2
no code implementations • 15 Feb 2022 • Zi-Qiang Zhang, Jie Zhang, Jian-Shu Zhang, Ming-Hui Wu, Xin Fang, Li-Rong Dai
The proposed approach explores both the complementarity of audio-visual modalities and long-term context dependency using a transformer-based fusion module and a flexible masking strategy.
no code implementations • 22 Jan 2022 • Qiu-Shi Zhu, Jie Zhang, Zi-Qiang Zhang, Ming-Hui Wu, Xin Fang, Li-Rong Dai
In this work, we therefore first analyze the noise robustness of wav2vec2. 0 via experiments.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+3
no code implementations • 7 May 2021 • Wenbo Wang, Xin Fang, Anthony Florita
Distributed energy resource (DER) frequency regulations are promising technologies for future grid operation.
no code implementations • 19 Mar 2021 • Yuxuan Wang, Maokui He, Shutong Niu, Lei Sun, Tian Gao, Xin Fang, Jia Pan, Jun Du, Chin-Hui Lee
This system description describes our submission system to the Third DIHARD Speech Diarization Challenge.
no code implementations • 15 Mar 2021 • Zi-Qiang Zhang, Yan Song, Ming-Hui Wu, Xin Fang, Li-Rong Dai
In this paper, we propose a weakly supervised multilingual representation learning framework, called cross-lingual self-training (XLST).
no code implementations • 24 Nov 2020 • Hantao Cui, Fangxing Li, Xin Fang
This letter investigates parallelism approaches for equation and Jacobian evaluations in large-scale power flow calculation.