Search Results for author: Tianrui Wang

Found 10 papers, 3 papers with code

A Refining Underlying Information Framework for Monaural Speech Enhancement

1 code implementation18 Dec 2023 Rui Cao, Tianrui Wang, Meng Ge, Longbiao Wang, Jianwu Dang

By bridging the speech enhancement and the Information Bottleneck principle in this letter, we rethink a universal plug-and-play strategy and propose a Refining Underlying Information framework called RUI to rise to the challenges both in theory and practice.

Speech Enhancement

On decoder-only architecture for speech-to-text and large language model integration

no code implementations8 Jul 2023 Jian Wu, Yashesh Gaur, Zhuo Chen, Long Zhou, Yimeng Zhu, Tianrui Wang, Jinyu Li, Shujie Liu, Bo Ren, Linquan Liu, Yu Wu

Large language models (LLMs) have achieved remarkable success in the field of natural language processing, enabling better human-computer interaction using natural language.

Language Modelling Large Language Model +1

VioLA: Unified Codec Language Models for Speech Recognition, Synthesis, and Translation

no code implementations25 May 2023 Tianrui Wang, Long Zhou, Ziqiang Zhang, Yu Wu, Shujie Liu, Yashesh Gaur, Zhuo Chen, Jinyu Li, Furu Wei

Recent research shows a big convergence in model architecture, training objectives, and inference methods across various tasks for different modalities.

Language Modelling Multi-Task Learning +3

An Adapter based Multi-label Pre-training for Speech Separation and Enhancement

no code implementations11 Nov 2022 Tianrui Wang, Xie Chen, Zhuo Chen, Shu Yu, Weibin Zhu

In recent years, self-supervised learning (SSL) has achieved tremendous success in various speech tasks due to its power to extract representations from massive unlabeled data.

Denoising Pseudo Label +4

Multiple Confidence Gates For Joint Training Of SE And ASR

no code implementations1 Apr 2022 Tianrui Wang, Weibin Zhu, Yingying Gao, Junlan Feng, Shilei Zhang

Joint training of speech enhancement model (SE) and speech recognition model (ASR) is a common solution for robust ASR in noisy environments.

Robust Speech Recognition Speech Enhancement +1

Harmonic gated compensation network plus for ICASSP 2022 DNS CHALLENGE

no code implementations25 Feb 2022 Tianrui Wang, Weibin Zhu, Yingying Gao, Yanan Chen, Junlan Feng, Shilei Zhang

Therefore, we previously proposed a harmonic gated compensation network (HGCN) to predict the full harmonic locations based on the unmasked harmonics and process the result of a coarse enhancement module to recover the masked harmonics.

HGCN: Harmonic gated compensation network for speech enhancement

1 code implementation30 Jan 2022 Tianrui Wang, Weibin Zhu, Yingying Gao, Junlan Feng, Shilei Zhang

Mask processing in the time-frequency (T-F) domain through the neural network has been one of the mainstreams for single-channel speech enhancement.

Action Detection Activity Detection +1

Is Image Encoding Beneficial for Deep Learning in Finance? An Analysis of Image Encoding Methods for the Application of Convolutional Neural Networks in Finance

no code implementations17 Oct 2020 Dan Wang, Tianrui Wang, Ionuţ Florescu

We find that using imaging techniques to input data for CNN works better for financial ratio data but is not significantly better than simply using the 1D input directly for fundamental data.

Retrieval

Cannot find the paper you are looking for? You can Submit a new open access paper.