Search Results for author: Wubo Li

Found 7 papers, 4 papers with code

DiDiSpeech: A Large Scale Mandarin Speech Corpus

no code implementations19 Oct 2020 Tingwei Guo, Cheng Wen, Dongwei Jiang, Ne Luo, Ruixiong Zhang, Shuaijiang Zhao, Wubo Li, Cheng Gong, Wei Zou, Kun Han, Xiangang Li

This paper introduces a new open-sourced Mandarin speech corpus, called DiDiSpeech.

Audio and Speech Processing

A Further Study of Unsupervised Pre-training for Transformer Based Speech Recognition

1 code implementation20 May 2020 Dongwei Jiang, Wubo Li, Ruixiong Zhang, Miao Cao, Ne Luo, Yang Han, Wei Zou, Xiangang Li

In this paper, we conduct a further study on MPC and focus on three important aspects: the effect of pre-training data speaking style, its extension on streaming model, and how to better transfer learned knowledge from pre-training stage to downstream tasks.

speech-recognition Speech Recognition +2

TCT: A Cross-supervised Learning Method for Multimodal Sequence Representation

no code implementations23 Oct 2019 Wubo Li, Wei Zou, Xiangang Li

Multimodalities provide promising performance than unimodality in most tasks.

A Multi-Modal Chinese Poetry Generation Model

1 code implementation26 Jun 2018 Dayiheng Liu, Quan Guo, Wubo Li, Jiancheng Lv

Given a picture, the first line, the title and the other lines of the poem are successively generated in three stages.


Cannot find the paper you are looking for? You can Submit a new open access paper.