Search Results for author: Shilei Zhang

Found 11 papers, 2 papers with code

Plugin Speech Enhancement: A Universal Speech Enhancement Framework Inspired by Dynamic Neural Network

no code implementations20 Feb 2024 Yanan Chen, Zihao Cui, Yingying Gao, Junlan Feng, Chao Deng, Shilei Zhang

In this study, we present a novel weighting prediction approach, which explicitly learns the task relationships from downstream training information to address the core challenge of universal speech enhancement.

Data Augmentation Speech Enhancement

GenDistiller: Distilling Pre-trained Language Models based on Generative Models

no code implementations20 Oct 2023 Yingying Gao, Shilei Zhang, Zihao Cui, Yanhan Xu, Chao Deng, Junlan Feng

Self-supervised pre-trained models such as HuBERT and WavLM leverage unlabeled speech data for representation learning and offer significantly improve for numerous downstream tasks.

Knowledge Distillation Language Modelling +1

MFAS: Emotion Recognition through Multiple Perspectives Fusion Architecture Search Emulating Human Cognition

no code implementations12 Jun 2023 Haiyang Sun, FuLin Zhang, Zheng Lian, Yingying Guo, Shilei Zhang

Additionally, considering that humans adjust their perception of emotional words in textual semantic based on certain cues present in speech, we design a novel search space and search for the optimal fusion strategy for the two types of information.

Quantization Speech Emotion Recognition

Meta Auxiliary Learning for Low-resource Spoken Language Understanding

no code implementations26 Jun 2022 Yingying Gao, Junlan Feng, Chao Deng, Shilei Zhang

Spoken language understanding (SLU) treats automatic speech recognition (ASR) and natural language understanding (NLU) as a unified task and usually suffers from data scarcity.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Multiple Confidence Gates For Joint Training Of SE And ASR

no code implementations1 Apr 2022 Tianrui Wang, Weibin Zhu, Yingying Gao, Junlan Feng, Shilei Zhang

Joint training of speech enhancement model (SE) and speech recognition model (ASR) is a common solution for robust ASR in noisy environments.

Robust Speech Recognition Speech Enhancement +1

Harmonic gated compensation network plus for ICASSP 2022 DNS CHALLENGE

no code implementations25 Feb 2022 Tianrui Wang, Weibin Zhu, Yingying Gao, Yanan Chen, Junlan Feng, Shilei Zhang

Therefore, we previously proposed a harmonic gated compensation network (HGCN) to predict the full harmonic locations based on the unmasked harmonics and process the result of a coarse enhancement module to recover the masked harmonics.

HGCN: Harmonic gated compensation network for speech enhancement

1 code implementation30 Jan 2022 Tianrui Wang, Weibin Zhu, Yingying Gao, Junlan Feng, Shilei Zhang

Mask processing in the time-frequency (T-F) domain through the neural network has been one of the mainstreams for single-channel speech enhancement.

Action Detection Activity Detection +1

Identity-Enhanced Network for Facial Expression Recognition

no code implementations11 Dec 2018 Yanwei Li, Xingang Wang, Shilei Zhang, Lingxi Xie, Wenqi Wu, Hongyuan Yu, Zheng Zhu

Facial expression recognition is a challenging task, arguably because of large intra-class variations and high inter-class similarities.

Facial Expression Recognition Facial Expression Recognition (FER) +1

Cannot find the paper you are looking for? You can Submit a new open access paper.