Search Results for author: Renjie Zheng

Found 23 papers, 4 papers with code

ERNIE-SAT: Speech and Text Joint Pretraining for Cross-Lingual Multi-Speaker Text-to-Speech

2 code implementations7 Nov 2022 Xiaoran Fan, Chao Pang, Tian Yuan, He Bai, Renjie Zheng, Pengfei Zhu, Shuohuan Wang, Junkun Chen, Zeyu Chen, Liang Huang, Yu Sun, Hua Wu

In this paper, we extend the pretraining method for cross-lingual multi-speaker speech synthesis tasks, including cross-lingual multi-speaker voice cloning and cross-lingual multi-speaker speech editing.

Representation Learning Speech Synthesis +2

Data-Driven Adaptive Simultaneous Machine Translation

no code implementations27 Apr 2022 Guangxu Xun, Mingbo Ma, Yuchen Bian, Xingyu Cai, Jiaji Huang, Renjie Zheng, Junkun Chen, Jiahong Yuan, Kenneth Church, Liang Huang

In simultaneous translation (SimulMT), the most widely used strategy is the wait-k policy thanks to its simplicity and effectiveness in balancing translation quality and latency.

Machine Translation Sentence +1

A$^3$T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing

2 code implementations18 Mar 2022 He Bai, Renjie Zheng, Junkun Chen, Xintong Li, Mingbo Ma, Liang Huang

Recently, speech representation learning has improved many speech-related tasks such as speech recognition, speech classification, and speech-to-text translation.

Representation Learning Speaker Verification +5

The Role of Phonetic Units in Speech Emotion Recognition

no code implementations2 Aug 2021 Jiahong Yuan, Xingyu Cai, Renjie Zheng, Liang Huang, Kenneth Church

Models of phonemes, broad phonetic classes, and syllables all significantly outperform the utterance model, demonstrating that phonetic units are helpful and should be incorporated in speech emotion recognition.

Speech Emotion Recognition speech-recognition +1

MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation

no code implementations22 Oct 2020 Junkun Chen, Mingbo Ma, Renjie Zheng, Liang Huang

End-to-end Speech-to-text Translation (E2E-ST), which directly translates source language speech to target language text, is widely useful in practice, but traditional cascaded approaches (ASR+MT) often suffer from error propagation in the pipeline.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +4

Improving Simultaneous Translation by Incorporating Pseudo-References with Fewer Reorderings

no code implementations EMNLP 2021 Junkun Chen, Renjie Zheng, Atsuhito Kita, Mingbo Ma, Liang Huang

Simultaneous translation is vastly different from full-sentence translation, in the sense that it starts translation before the source sentence ends, with only a few words delay.

Sentence Translation

Fluent and Low-latency Simultaneous Speech-to-Speech Translation with Self-adaptive Training

no code implementations Findings of the Association for Computational Linguistics 2020 Renjie Zheng, Mingbo Ma, Baigong Zheng, Kaibo Liu, Jiahong Yuan, Kenneth Church, Liang Huang

Simultaneous speech-to-speech translation is widely useful but extremely challenging, since it needs to generate target-language speech concurrently with the source-language speech, with only a few seconds delay.

Sentence Speech-to-Speech Translation +1

Opportunistic Decoding with Timely Correction for Simultaneous Translation

no code implementations ACL 2020 Renjie Zheng, Mingbo Ma, Baigong Zheng, Kaibo Liu, Liang Huang

Simultaneous translation has many important application scenarios and attracts much attention from both academia and industry recently.

Translation

Simultaneous Translation Policies: From Fixed to Adaptive

no code implementations ACL 2020 Baigong Zheng, Kaibo Liu, Renjie Zheng, Mingbo Ma, Hairong Liu, Liang Huang

Adaptive policies are better than fixed policies for simultaneous translation, since they can flexibly balance the tradeoff between translation quality and latency based on the current context information.

Sentence Translation

Speculative Beam Search for Simultaneous Translation

no code implementations IJCNLP 2019 Renjie Zheng, Mingbo Ma, Baigong Zheng, Liang Huang

Beam search is universally used in full-sentence translation but its application to simultaneous translation remains non-trivial, where output words are committed on the fly.

Language Modelling Sentence +1

Ensemble Sequence Level Training for Multimodal MT: OSU-Baidu WMT18 Multimodal Machine Translation System Report

no code implementations WS 2018 Renjie Zheng, Yilin Yang, Mingbo Ma, Liang Huang

This paper describes multimodal machine translation systems developed jointly by Oregon State University and Baidu Research for WMT 2018 Shared Task on multimodal translation.

Multimodal Machine Translation reinforcement-learning +2

Multi-Reference Training with Pseudo-References for Neural Translation and Text Generation

no code implementations EMNLP 2018 Renjie Zheng, Mingbo Ma, Liang Huang

Neural text generation, including neural machine translation, image captioning, and summarization, has been quite successful recently.

Image Captioning Machine Translation +2

Same Representation, Different Attentions: Shareable Sentence Representation Learning from Multiple Tasks

no code implementations22 Apr 2018 Renjie Zheng, Junkun Chen, Xipeng Qiu

More specifically, all tasks share the same sentence representation and each task can select the task-specific information from the shared sentence representation with attention mechanism.

General Classification Multi-Task Learning +4

Cannot find the paper you are looking for? You can Submit a new open access paper.