Search Results for author: Tianrui Wang

Found 10 papers, 3 papers with code

A Refining Underlying Information Framework for Monaural Speech Enhancement

1 code implementation • 18 Dec 2023 • Rui Cao, Tianrui Wang, Meng Ge, Longbiao Wang, Jianwu Dang

By bridging the speech enhancement and the Information Bottleneck principle in this letter, we rethink a universal plug-and-play strategy and propose a Refining Underlying Information framework called RUI to rise to the challenges both in theory and practice.

Speech Enhancement

Paper
Code

On decoder-only architecture for speech-to-text and large language model integration

no code implementations • 8 Jul 2023 • Jian Wu, Yashesh Gaur, Zhuo Chen, Long Zhou, Yimeng Zhu, Tianrui Wang, Jinyu Li, Shujie Liu, Bo Ren, Linquan Liu, Yu Wu

Large language models (LLMs) have achieved remarkable success in the field of natural language processing, enabling better human-computer interaction using natural language.

Language Modelling Large Language Model +1

Paper
Add Code

VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation

no code implementations • 25 May 2023 • Tianrui Wang, Long Zhou, Ziqiang Zhang, Yu Wu, Shujie Liu, Yashesh Gaur, Zhuo Chen, Jinyu Li, Furu Wei

Recent research shows a big convergence in model architecture, training objectives, and inference methods across various tasks for different modalities.

Language Modelling Multi-Task Learning +3

Paper
Add Code

An Adapter based Multi-label Pre-training for Speech Separation and Enhancement

no code implementations • 11 Nov 2022 • Tianrui Wang, Xie Chen, Zhuo Chen, Shu Yu, Weibin Zhu

In recent years, self-supervised learning (SSL) has achieved tremendous success in various speech tasks due to its power to extract representations from massive unlabeled data.

Denoising Pseudo Label +4

Paper
Add Code

A CTC Triggered Siamese Network with Spatial-Temporal Dropout for Speech Recognition

no code implementations • 16 Jun 2022 • Yingying Gao, Junlan Feng, Tianrui Wang, Chao Deng, Shilei Zhang

Analysis shows that our proposed approach brings a better uniformity for the trained model and enlarges the CTC spikes obviously.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Multiple Confidence Gates For Joint Training Of SE And ASR

no code implementations • 1 Apr 2022 • Tianrui Wang, Weibin Zhu, Yingying Gao, Junlan Feng, Shilei Zhang

Joint training of speech enhancement model (SE) and speech recognition model (ASR) is a common solution for robust ASR in noisy environments.

Robust Speech Recognition Speech Enhancement +1

Paper
Add Code

Harmonic gated compensation network plus for ICASSP 2022 DNS CHALLENGE

no code implementations • 25 Feb 2022 • Tianrui Wang, Weibin Zhu, Yingying Gao, Yanan Chen, Junlan Feng, Shilei Zhang

Therefore, we previously proposed a harmonic gated compensation network (HGCN) to predict the full harmonic locations based on the unmasked harmonics and process the result of a coarse enhancement module to recover the masked harmonics.

Paper
Add Code

HGCN: Harmonic gated compensation network for speech enhancement

1 code implementation • 30 Jan 2022 • Tianrui Wang, Weibin Zhu, Yingying Gao, Junlan Feng, Shilei Zhang

Mask processing in the time-frequency (T-F) domain through the neural network has been one of the mainstreams for single-channel speech enhancement.

Action Detection Activity Detection +1

Paper
Code

A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement

1 code implementation • 26 Aug 2021 • Tianrui Wang, Weibin Zhu

Deep learning technology has been widely applied to speech enhancement.

Speech Enhancement

Paper
Code

Is Image Encoding Beneficial for Deep Learning in Finance? An Analysis of Image Encoding Methods for the Application of Convolutional Neural Networks in Finance

no code implementations • 17 Oct 2020 • Dan Wang, Tianrui Wang, Ionuţ Florescu

We find that using imaging techniques to input data for CNN works better for financial ratio data but is not significantly better than simply using the 1D input directly for fundamental data.

Retrieval

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.