Search Results for author: DeLiang Wang

Found 26 papers, 5 papers with code

Combined Generative and Predictive Modeling for Speech Super-resolution

no code implementations • 25 Jan 2024 • Heming Wang, Eric W. Healy, DeLiang Wang

Specifically, we employ a diffusion-based model that is conditioned on the output of a predictive model.

Paper
Add Code

Leveraging Laryngograph Data for Robust Voicing Detection in Speech

1 code implementation • 5 Dec 2023 • Yixuan Zhang, Heming Wang, DeLiang Wang

Accurately detecting voiced intervals in speech signals is a critical step in pitch tracking and has numerous applications.

Paper
Code

Multi-channel Conversational Speaker Separation via Neural Diarization

no code implementations • 15 Nov 2023 • Hassan Taherian, DeLiang Wang

To enhance ASR performance in conversational or meeting environments, continuous speaker separation (CSS) is commonly employed.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

NeuralKalman: A Learnable Kalman Filter for Acoustic Echo Cancellation

no code implementations • 29 Jan 2023 • Yixuan Zhang, Meng Yu, Hao Zhang, Dong Yu, DeLiang Wang

The robustness of the Kalman filter to double talk and its rapid convergence make it a popular approach for addressing acoustic echo cancellation (AEC) challenges.

Acoustic echo cancellation

Paper
Add Code

Multi-resolution location-based training for multi-channel continuous speech separation

no code implementations • 16 Jan 2023 • Hassan Taherian, DeLiang Wang

The performance of automatic speech recognition (ASR) systems severely degrades when multi-talker speech overlap occurs.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

Time-Domain Speech Enhancement for Robust Automatic Speech Recognition

no code implementations • 24 Oct 2022 • Yufeng Yang, Ashutosh Pandey, DeLiang Wang

However, speech enhancement has not been established as an effective frontend for robust automatic speech recognition (ASR) in noisy conditions compared to an ASR model trained on noisy speech directly.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration

1 code implementation • 12 Apr 2022 • Haohe Liu, Xubo Liu, Qiuqiang Kong, Qiao Tian, Yan Zhao, DeLiang Wang, Chuanzeng Huang, Yuxuan Wang

Speech restoration aims to remove distortions in speech signals.

Speech Denoising Speech Enhancement +1

915

Paper
Code

Neural Vocoder is All You Need for Speech Super-resolution

1 code implementation • 28 Mar 2022 • Haohe Liu, Woosung Choi, Xubo Liu, Qiuqiang Kong, Qiao Tian, DeLiang Wang

In this paper, we propose a neural vocoder based speech super-resolution method (NVSR) that can handle a variety of input resolution and upsampling ratios.

Ranked #2 on Audio Super-Resolution on VCTK Multi-Speaker

Audio Super-Resolution Bandwidth Extension +1

119

Paper
Code

A Conformer Based Acoustic Model for Robust Automatic Speech Recognition

no code implementations • 1 Mar 2022 • Yufeng Yang, Peidong Wang, DeLiang Wang

The proposed model builds on the wide residual bi-directional long short-term memory network (WRBN) with utterance-wise dropout and iterative speaker adaptation, but employs a Conformer encoder instead of the recurrent network.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Interpreting Deep Knowledge Tracing Model on EdNet Dataset

no code implementations • 31 Oct 2021 • DeLiang Wang, Yu Lu, Qinggang Meng, Penghe Chen

With more deep learning techniques being introduced into the knowledge tracing domain, the interpretability issue of the knowledge tracing models has aroused researchers' attention.

Knowledge Tracing

Paper
Add Code

Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction

no code implementations • 28 Oct 2021 • Heming Wang, Yao Qian, Xiaofei Wang, Yiming Wang, Chengyi Wang, Shujie Liu, Takuya Yoshioka, Jinyu Li, DeLiang Wang

The reconstruction module is used for auxiliary learning to improve the noise robustness of the learned representation and thus is not required during inference.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +8

Paper
Add Code

Location-based training for multi-channel talker-independent speaker separation

no code implementations • 8 Oct 2021 • Hassan Taherian, Ke Tan, DeLiang Wang

We further demonstrate the effectiveness of LBT for the separation of four and five concurrent speakers.

Speaker Separation

Paper
Add Code

Multi-Channel and Multi-Microphone Acoustic Echo Cancellation Using A Deep Learning Based Approach

no code implementations • 3 Mar 2021 • Hao Zhang, DeLiang Wang

Building on the deep learning based acoustic echo cancellation (AEC) in the single-loudspeaker (single-channel) and single-microphone setup, this paper investigates multi-channel AEC (MCAEC) and multi-microphone AEC (MMAEC).

Acoustic echo cancellation

Paper
Add Code

Efficient End-to-End Speech Recognition Using Performers in Conformers

no code implementations • 9 Nov 2020 • Peidong Wang, DeLiang Wang

On-device end-to-end speech recognition poses a high requirement on model efficiency.

speech-recognition Speech Recognition

Paper
Add Code

Speaker Separation Using Speaker Inventories and Estimated Speech

no code implementations • 20 Oct 2020 • Peidong Wang, Zhuo Chen, DeLiang Wang, Jinyu Li, Yifan Gong

We propose speaker separation using speaker inventories and estimated speech (SSUSIES), a framework leveraging speaker profiles and estimated speech for speaker separation.

Speaker Separation Speech Extraction +2

Paper
Add Code

Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speech Separation

2 code implementations • 4 Oct 2020 • Zhong-Qiu Wang, Peidong Wang, DeLiang Wang

Although our system is trained on simulated room impulse responses (RIR) based on a fixed number of microphones arranged in a given geometry, it generalizes well to a real array with the same geometry.

Speaker Separation Speech Separation

Paper
Code

Dense CNN with Self-Attention for Time-Domain Speech Enhancement

no code implementations • 3 Sep 2020 • Ashutosh Pandey, DeLiang Wang

Even though the proposed loss is based on magnitudes only, a constraint imposed by noise prediction ensures that the loss enhances both magnitude and phase.

Speech Enhancement

Paper
Add Code

Towards Interpretable Deep Learning Models for Knowledge Tracing

no code implementations • 13 May 2020 • Yu Lu, DeLiang Wang, Qinggang Meng, Penghe Chen

We thus propose to adopt the post-hoc method to tackle the interpretability issue for deep learning based knowledge tracing (DLKT) models.

Knowledge Tracing

Paper
Add Code

Divide and Conquer: A Deep CASA Approach to Talker-independent Monaural Speaker Separation

1 code implementation • 25 Apr 2019 • Yuzhou Liu, DeLiang Wang

Simultaneous grouping is first performed in each time frame by separating the spectra of different speakers with a permutation-invariantly trained neural network.

Ranked #21 on Speech Separation on WSJ0-2mix

Clustering Speaker Separation +1

Paper
Code

Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling

no code implementations • 11 Mar 2019 • Peidong Wang, Ke Tan, DeLiang Wang

In this study, we analyze the distortion problem, compare different acoustic models, and investigate a distortion-independent training scheme for monaural speech recognition.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Deep Learning Based Phase Reconstruction for Speaker Separation: A Trigonometric Perspective

no code implementations • 22 Nov 2018 • Zhong-Qiu Wang, Ke Tan, DeLiang Wang

This study investigates phase reconstruction for deep learning based monaural talker-independent speaker separation in the short-time Fourier transform (STFT) domain.

Speaker Separation

Paper
Add Code

End-to-End Speech Separation with Unfolded Iterative Phase Reconstruction

no code implementations • 26 Apr 2018 • Zhong-Qiu Wang, Jonathan Le Roux, DeLiang Wang, John R. Hershey

In addition, we train through unfolded iterations of a phase reconstruction algorithm, represented as a series of STFT and inverse STFT layers.

Speech Separation

Paper
Add Code

Supervised Speech Separation Based on Deep Learning: An Overview

no code implementations • 24 Aug 2017 • DeLiang Wang, Jitong Chen

A more recent approach formulates speech separation as a supervised learning problem, where the discriminative patterns of speech, speakers, and background noise are learned from training data.

Speaker Separation Speech Dereverberation +1

Paper
Add Code

Incorporating Language Level Information into Acoustic Models

no code implementations • 14 Dec 2016 • Peidong Wang, DeLiang Wang

This paper proposed a class of novel Deep Recurrent Neural Networks which can incorporate language-level information into acoustic models.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Recurrent Deep Stacking Networks for Speech Recognition

no code implementations • 14 Dec 2016 • Peidong Wang, Zhongqiu Wang, DeLiang Wang

This paper presented our work on applying Recurrent Deep Stacking Networks (RDSNs) to Robust Automatic Speech Recognition (ASR) tasks.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Cocktail Party Processing via Structured Prediction

no code implementations • NeurIPS 2012 • Yuxuan Wang, DeLiang Wang

While human listeners excel at selectively attending to a conversation in a cocktail party, machine performance is still far inferior by comparison.

General Classification Speech Separation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.