Search Results for author: Ying Fang

Found 10 papers, 5 papers with code

CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR

1 code implementation27 Feb 2025 Nian Shao, Rui Zhou, Pengyu Wang, Xian Li, Ying Fang, Yujie Yang, Xiaofei Li

Compared to linear-frequency domain or time-domain speech enhancement, the key advantage of Mel-spectrogram enhancement is that Mel-frequency presents speech in a more compact way and thus is easier to learn, which will benefit both speech quality and ASR.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

VINP: Variational Bayesian Inference with Neural Speech Prior for Joint ASR-Effective Speech Dereverberation and Blind RIR Identification

1 code implementation11 Feb 2025 Pengyu Wang, Ying Fang, Xiaofei Li

Reverberant speech, denoting the speech signal degraded by the process of reverberation, contains crucial knowledge of both anechoic source speech and room impulse response (RIR).

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Evaluating the Design Features of an Intelligent Tutoring System for Advanced Mathematics Learning

no code implementations23 Dec 2024 Ying Fang, Bo He, Zhi Liu, Sannyuya Liu, Zhonghua Yan, Jianwen Sun

Xiaomai is an intelligent tutoring system (ITS) designed to help Chinese college students in learning advanced mathematics and preparing for the graduate school math entrance exam.

Math

Converting High-Performance and Low-Latency SNNs through Explicit Modelling of Residual Error in ANNs

no code implementations26 Apr 2024 Zhipeng Huang, Jianhao Ding, Zhiyu Pan, Haoran Li, Ying Fang, Zhaofei Yu, Jian K. Liu

One of the mainstream approaches to implementing deep SNNs is the ANN-SNN conversion, which integrates the efficient training strategy of ANNs with the energy-saving potential and fast inference capability of SNNs.

Mel-FullSubNet: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR

no code implementations21 Feb 2024 Rui Zhou, Xian Li, Ying Fang, Xiaofei Li

In this work, we propose Mel-FullSubNet, a single-channel Mel-spectrogram denoising and dereverberation network for improving both speech quality and automatic speech recognition (ASR) performance.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Unimodal Aggregation for CTC-based Speech Recognition

1 code implementation15 Sep 2023 Ying Fang, Xiaofei Li

Then, the feature frames with unimodal weights are integrated and further processed by a decoder.

Automatic Speech Recognition Decoder +1

Biologically Plausible Variational Policy Gradient with Spiking Recurrent Winner-Take-All Networks

1 code implementation21 Oct 2022 Zhile Yang, Shangqi Guo, Ying Fang, Jian K. Liu

One stream of reinforcement learning research is exploring biologically plausible models and algorithms to simulate biological intelligence and fit neuromorphic hardware.

All Reinforcement Learning (RL)

Cannot find the paper you are looking for? You can Submit a new open access paper.