Search Results for author: Zhendong Peng

Found 10 papers, 4 papers with code

Radar Rainbow Beams For Wideband mmWave Communication: Beam Training And Tracking

no code implementations • 14 Mar 2024 • Gui Zhou, Moritz Garkisch, Zhendong Peng, Cunhua Pan, Robert Schober

Three detection and estimation schemes are proposed based on radar rainbow beams for estimation of the users' angles, distances, and velocities, which are then exploited for communication beamformer design.

Paper
Add Code

ZeroPrompt: Streaming Acoustic Encoders are Zero-Shot Masked LMs

no code implementations • 18 May 2023 • Xingchen Song, Di wu, BinBin Zhang, Zhendong Peng, Bo Dang, Fuping Pan, Zhiyong Wu

In this paper, we present ZeroPrompt (Figure 1-(a)) and the corresponding Prompt-and-Refine strategy (Figure 3), two simple but effective \textbf{training-free} methods to decrease the Token Display Time (TDT) of streaming ASR models \textbf{without any accuracy loss}.

Paper
Add Code

TrimTail: Low-Latency Streaming ASR with Simple but Effective Spectrogram-Level Length Penalty

1 code implementation • 1 Nov 2022 • Xingchen Song, Di wu, Zhiyong Wu, BinBin Zhang, Yuekai Zhang, Zhendong Peng, Wenpeng Li, Fuping Pan, Changbao Zhu

In this paper, we present TrimTail, a simple but effective emission regularization method to improve the latency of streaming ASR models.

3,687

Paper
Code

FusionFormer: Fusing Operations in Transformer for Efficient Streaming Speech Recognition

no code implementations • 31 Oct 2022 • Xingchen Song, Di wu, BinBin Zhang, Zhiyong Wu, Wenpeng Li, Dongfang Li, Pengshen Zhang, Zhendong Peng, Fuping Pan, Changbao Zhu, Zhongqin Wu

Therefore, we name it FusionFormer.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

Error Propagation and Overhead Reduced Channel Estimation for RIS-Aided Multi-User mmWave Systems

no code implementations • 23 Aug 2022 • Zhendong Peng, Cunhua Pan, Gui Zhou, Hong Ren

In this paper, we propose a novel two-stage based uplink channel estimation strategy with reduced pilot overhead and error propagation for a reconfigurable intelligent surface (RIS)-aided multi-user (MU) millimeter wave (mmWave) system.

Paper
Add Code

Channel Estimation for RIS-Aided Multi-User mmWave Systems with Uniform Planar Arrays

no code implementations • 15 Aug 2022 • Zhendong Peng, Gui Zhou, Cunhua Pan, Hong Ren, A. Lee Swindlehurst, Petar Popovski, Gang Wu

Specifically, in Stage I, the channel state information (CSI) of a typical user is estimated.

Paper
Add Code

WeNet 2.0: More Productive End-to-End Speech Recognition Toolkit

3 code implementations • 29 Mar 2022 • BinBin Zhang, Di wu, Zhendong Peng, Xingchen Song, Zhuoyuan Yao, Hang Lv, Lei Xie, Chao Yang, Fuping Pan, Jianwei Niu

Recently, we made available WeNet, a production-oriented end-to-end speech recognition toolkit, which introduces a unified two-pass (U2) framework and a built-in runtime to address the streaming and non-streaming decoding modes in a single model.

Language Modelling speech-recognition +1

3,687

Paper
Code

WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition

1 code implementation • 7 Oct 2021 • BinBin Zhang, Hang Lv, Pengcheng Guo, Qijie Shao, Chao Yang, Lei Xie, Xin Xu, Hui Bu, Xiaoyu Chen, Chenchen Zeng, Di wu, Zhendong Peng

In this paper, we present WenetSpeech, a multi-domain Mandarin corpus consisting of 10000+ hours high-quality labeled speech, 2400+ hours weakly labeled speech, and about 10000 hours unlabeled speech, with 22400+ hours in total.

Ranked #5 on Speech Recognition on WenetSpeech

Label Error Detection Optical Character Recognition +4

454

Paper
Code

U2++: Unified Two-pass Bidirectional End-to-end Model for Speech Recognition

no code implementations • 10 Jun 2021 • Di wu, BinBin Zhang, Chao Yang, Zhendong Peng, Wenjing Xia, Xiaoyu Chen, Xin Lei

On the experiment of AISHELL-1, we achieve a 4. 63\% character error rate (CER) with a non-streaming setup and 5. 05\% with a streaming setup with 320ms latency by U2++.

Data Augmentation speech-recognition +1

Paper
Add Code

WeNet: Production oriented Streaming and Non-streaming End-to-End Speech Recognition Toolkit

4 code implementations • 2 Feb 2021 • Zhuoyuan Yao, Di wu, Xiong Wang, BinBin Zhang, Fan Yu, Chao Yang, Zhendong Peng, Xiaoyu Chen, Lei Xie, Xin Lei

In this paper, we propose an open source, production first, and production ready speech recognition toolkit called WeNet in which a new two-pass approach is implemented to unify streaming and non-streaming end-to-end (E2E) speech recognition in a single model.

speech-recognition Speech Recognition

10,131

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.