Search Results for author: DeLiang Wang

Found 26 papers, 5 papers with code

Combined Generative and Predictive Modeling for Speech Super-resolution

no code implementations25 Jan 2024 Heming Wang, Eric W. Healy, DeLiang Wang

Specifically, we employ a diffusion-based model that is conditioned on the output of a predictive model.

Super-Resolution

Leveraging Laryngograph Data for Robust Voicing Detection in Speech

1 code implementation5 Dec 2023 Yixuan Zhang, Heming Wang, DeLiang Wang

Accurately detecting voiced intervals in speech signals is a critical step in pitch tracking and has numerous applications.

Multi-channel Conversational Speaker Separation via Neural Diarization

no code implementations15 Nov 2023 Hassan Taherian, DeLiang Wang

To enhance ASR performance in conversational or meeting environments, continuous speaker separation (CSS) is commonly employed.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

NeuralKalman: A Learnable Kalman Filter for Acoustic Echo Cancellation

no code implementations29 Jan 2023 Yixuan Zhang, Meng Yu, Hao Zhang, Dong Yu, DeLiang Wang

The robustness of the Kalman filter to double talk and its rapid convergence make it a popular approach for addressing acoustic echo cancellation (AEC) challenges.

Acoustic echo cancellation

Time-Domain Speech Enhancement for Robust Automatic Speech Recognition

no code implementations24 Oct 2022 Yufeng Yang, Ashutosh Pandey, DeLiang Wang

However, speech enhancement has not been established as an effective frontend for robust automatic speech recognition (ASR) in noisy conditions compared to an ASR model trained on noisy speech directly.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Neural Vocoder is All You Need for Speech Super-resolution

1 code implementation28 Mar 2022 Haohe Liu, Woosung Choi, Xubo Liu, Qiuqiang Kong, Qiao Tian, DeLiang Wang

In this paper, we propose a neural vocoder based speech super-resolution method (NVSR) that can handle a variety of input resolution and upsampling ratios.

Audio Super-Resolution Bandwidth Extension +1

A Conformer Based Acoustic Model for Robust Automatic Speech Recognition

no code implementations1 Mar 2022 Yufeng Yang, Peidong Wang, DeLiang Wang

The proposed model builds on the wide residual bi-directional long short-term memory network (WRBN) with utterance-wise dropout and iterative speaker adaptation, but employs a Conformer encoder instead of the recurrent network.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Interpreting Deep Knowledge Tracing Model on EdNet Dataset

no code implementations31 Oct 2021 DeLiang Wang, Yu Lu, Qinggang Meng, Penghe Chen

With more deep learning techniques being introduced into the knowledge tracing domain, the interpretability issue of the knowledge tracing models has aroused researchers' attention.

Knowledge Tracing

Location-based training for multi-channel talker-independent speaker separation

no code implementations8 Oct 2021 Hassan Taherian, Ke Tan, DeLiang Wang

We further demonstrate the effectiveness of LBT for the separation of four and five concurrent speakers.

Speaker Separation

Multi-Channel and Multi-Microphone Acoustic Echo Cancellation Using A Deep Learning Based Approach

no code implementations3 Mar 2021 Hao Zhang, DeLiang Wang

Building on the deep learning based acoustic echo cancellation (AEC) in the single-loudspeaker (single-channel) and single-microphone setup, this paper investigates multi-channel AEC (MCAEC) and multi-microphone AEC (MMAEC).

Acoustic echo cancellation

Speaker Separation Using Speaker Inventories and Estimated Speech

no code implementations20 Oct 2020 Peidong Wang, Zhuo Chen, DeLiang Wang, Jinyu Li, Yifan Gong

We propose speaker separation using speaker inventories and estimated speech (SSUSIES), a framework leveraging speaker profiles and estimated speech for speaker separation.

Speaker Separation Speech Extraction +2

Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speech Separation

2 code implementations4 Oct 2020 Zhong-Qiu Wang, Peidong Wang, DeLiang Wang

Although our system is trained on simulated room impulse responses (RIR) based on a fixed number of microphones arranged in a given geometry, it generalizes well to a real array with the same geometry.

Speaker Separation Speech Separation

Dense CNN with Self-Attention for Time-Domain Speech Enhancement

no code implementations3 Sep 2020 Ashutosh Pandey, DeLiang Wang

Even though the proposed loss is based on magnitudes only, a constraint imposed by noise prediction ensures that the loss enhances both magnitude and phase.

Speech Enhancement

Towards Interpretable Deep Learning Models for Knowledge Tracing

no code implementations13 May 2020 Yu Lu, DeLiang Wang, Qinggang Meng, Penghe Chen

We thus propose to adopt the post-hoc method to tackle the interpretability issue for deep learning based knowledge tracing (DLKT) models.

Knowledge Tracing

Divide and Conquer: A Deep CASA Approach to Talker-independent Monaural Speaker Separation

1 code implementation25 Apr 2019 Yuzhou Liu, DeLiang Wang

Simultaneous grouping is first performed in each time frame by separating the spectra of different speakers with a permutation-invariantly trained neural network.

Clustering Speaker Separation +1

Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling

no code implementations11 Mar 2019 Peidong Wang, Ke Tan, DeLiang Wang

In this study, we analyze the distortion problem, compare different acoustic models, and investigate a distortion-independent training scheme for monaural speech recognition.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Deep Learning Based Phase Reconstruction for Speaker Separation: A Trigonometric Perspective

no code implementations22 Nov 2018 Zhong-Qiu Wang, Ke Tan, DeLiang Wang

This study investigates phase reconstruction for deep learning based monaural talker-independent speaker separation in the short-time Fourier transform (STFT) domain.

Speaker Separation

End-to-End Speech Separation with Unfolded Iterative Phase Reconstruction

no code implementations26 Apr 2018 Zhong-Qiu Wang, Jonathan Le Roux, DeLiang Wang, John R. Hershey

In addition, we train through unfolded iterations of a phase reconstruction algorithm, represented as a series of STFT and inverse STFT layers.

Speech Separation

Supervised Speech Separation Based on Deep Learning: An Overview

no code implementations24 Aug 2017 DeLiang Wang, Jitong Chen

A more recent approach formulates speech separation as a supervised learning problem, where the discriminative patterns of speech, speakers, and background noise are learned from training data.

Speaker Separation Speech Dereverberation +1

Incorporating Language Level Information into Acoustic Models

no code implementations14 Dec 2016 Peidong Wang, DeLiang Wang

This paper proposed a class of novel Deep Recurrent Neural Networks which can incorporate language-level information into acoustic models.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Recurrent Deep Stacking Networks for Speech Recognition

no code implementations14 Dec 2016 Peidong Wang, Zhongqiu Wang, DeLiang Wang

This paper presented our work on applying Recurrent Deep Stacking Networks (RDSNs) to Robust Automatic Speech Recognition (ASR) tasks.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Cocktail Party Processing via Structured Prediction

no code implementations NeurIPS 2012 Yuxuan Wang, DeLiang Wang

While human listeners excel at selectively attending to a conversation in a cocktail party, machine performance is still far inferior by comparison.

General Classification Speech Separation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.