Search Results for author: Jiangyu Han

Found 11 papers, 5 papers with code

DiaCorrect: Error Correction Back-end For Speaker Diarization

1 code implementation15 Sep 2023 Jiangyu Han, Federico Landini, Johan Rohdin, Mireia Diez, Lukas Burget, Yuhang Cao, Heng Lu, Jan Cernocky

In this work, we propose an error correction framework, named DiaCorrect, to refine the output of a diarization system in a simple yet effective way.

Automatic Speech Recognition speaker-diarization +3

Dynamic Acoustic Compensation and Adaptive Focal Training for Personalized Speech Enhancement

no code implementations22 Nov 2022 Xiaofeng Ge, Jiangyu Han, Haixin Guan, Yanhua Long

Recently, more and more personalized speech enhancement systems (PSE) with excellent performance have been proposed.

Speech Enhancement

Heterogeneous Separation Consistency Training for Adaptation of Unsupervised Speech Separation

no code implementations23 Apr 2022 Jiangyu Han, Yanhua Long

SCT follows a framework using two heterogeneous neural networks (HNNs) to produce high confidence pseudo labels of unlabeled real speech mixtures.

Speech Separation

PercepNet+: A Phase and SNR Aware PercepNet for Real-Time Speech Enhancement

no code implementations4 Mar 2022 Xiaofeng Ge, Jiangyu Han, Yanhua Long, Haixin Guan

Finally, we propose to integrate the loss of complex subband gain, SNR, pitch filtering strength, and an OA loss in a multi-objective learning manner to further improve the speech enhancement performance.

Speech Enhancement

DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation And Extraction

1 code implementation27 Dec 2021 Jiangyu Han, Yanhua Long, Lukas Burget, Jan Cernocky

Particularly, we find that the Mixture-Remix fine-tuning with DPCCN significantly outperforms the TD-SpeakerBeam for unsupervised cross-domain TSE, with around 3. 5 dB SISNR improvement on target domain test set, without any source domain performance degradation.

Speech Extraction

Improving Channel Decorrelation for Multi-Channel Target Speech Extraction

no code implementations6 Jun 2021 Jiangyu Han, Wei Rao, Yannan Wang, Yanhua Long

Moreover, new combination strategies of the CD-based spatial information and target speaker adaptation of parallel encoder outputs are also investigated.

Speech Extraction

Multi-channel target speech extraction with channel decorrelation and target speaker adaptation

1 code implementation19 Oct 2020 Jiangyu Han, Xinyuan Zhou, Yanhua Long, Yijie Li

In this work, we propose two methods for exploiting the multi-channel spatial information to extract the target speech.

Speech Extraction Audio and Speech Processing

Attention-based scaling adaptation for target speech extraction

no code implementations19 Oct 2020 Jiangyu Han, Wei Rao, Yanhua Long, Jiaen Liang

Furthermore, by introducing a mixture embedding matrix pooling method, our proposed attention-based scaling adaptation (ASA) can exploit the target speaker clues in a more efficient way.

Speech Extraction

Cannot find the paper you are looking for? You can Submit a new open access paper.