Search Results for author: Shanbo Cheng

Found 22 papers, 4 papers with code

MT-PATCHER: Selective and Extendable Knowledge Distillation from Large Language Models for Machine Translation

no code implementations • 14 Mar 2024 • Jiahuan Li, Shanbo Cheng, ShuJian Huang, Jiajun Chen

Large Language Models (LLM) have demonstrated their strong ability in the field of machine translation (MT), yet they suffer from high computational cost and latency.

Knowledge Distillation Machine Translation +1

Paper
Add Code

Speech Translation with Large Language Models: An Industrial Practice

no code implementations • 21 Dec 2023 • Zhichao Huang, Rong Ye, Tom Ko, Qianqian Dong, Shanbo Cheng, Mingxuan Wang, Hang Li

Given the great success of large language models (LLMs) across various tasks, in this paper, we introduce LLM-ST, a novel and effective speech translation model constructed upon a pre-trained LLM.

Language Modelling Large Language Model +1

Paper
Add Code

Only 5\% Attention Is All You Need: Efficient Long-range Document-level Neural Machine Translation

no code implementations • 25 Sep 2023 • Zihan Liu, Zewei Sun, Shanbo Cheng, ShuJian Huang, Mingxuan Wang

Document-level Neural Machine Translation (DocNMT) has been proven crucial for handling discourse phenomena by introducing document-level context information.

Dimensionality Reduction Machine Translation +1

Paper
Add Code

Eliciting the Translation Ability of Large Language Models via Multilingual Finetuning with Translation Instructions

no code implementations • 24 May 2023 • Jiahuan Li, Hao Zhou, ShuJian Huang, Shanbo Cheng, Jiajun Chen

Secondly, we find that LLMs' ability to carry out translation instructions relies on the understanding of translation instructions and the alignment among different languages.

Language Modelling Translation

Paper
Add Code

BigVideo: A Large-scale Video Subtitle Translation Dataset for Multimodal Machine Translation

1 code implementation • 23 May 2023 • Liyan Kang, Luyang Huang, Ningxin Peng, Peihao Zhu, Zewei Sun, Shanbo Cheng, Mingxuan Wang, Degen Huang, Jinsong Su

We also introduce two deliberately designed test sets to verify the necessity of visual information: Ambiguous with the presence of ambiguous words, and Unambiguous in which the text context is self-contained for translation.

Contrastive Learning Multimodal Machine Translation +3

Paper
Code

Visual Information Matters for ASR Error Correction

no code implementations • 16 Mar 2023 • Vanya Bannihatti Kumar, Shanbo Cheng, Ningxin Peng, Yuchen Zhang

Aiming to improve the Automatic Speech Recognition (ASR) outputs with a post-processing step, ASR error correction (EC) techniques have been widely developed due to their efficiency in using parallel text data.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Add Code

Beyond Triplet: Leveraging the Most Data for Multimodal Machine Translation

no code implementations • 20 Dec 2022 • Yaoming Zhu, Zewei Sun, Shanbo Cheng, Luyang Huang, Liwei Wu, Mingxuan Wang

Therefore, this paper correspondingly establishes new methods and new datasets for MMT.

Multimodal Machine Translation Translation

Paper
Add Code

Better Datastore, Better Translation: Generating Datastores from Pre-Trained Models for Nearest Neural Machine Translation

no code implementations • 17 Dec 2022 • Jiahuan Li, Shanbo Cheng, Zewei Sun, Mingxuan Wang, ShuJian Huang

The effectiveness of kNNMT directly depends on the quality of retrieved neighbors.

Machine Translation NMT +2

Paper
Add Code

Controlling Styles in Neural Machine Translation with Activation Prompt

1 code implementation • 17 Dec 2022 • Yifan Wang, Zewei Sun, Shanbo Cheng, Weiguo Zheng, Mingxuan Wang

Controlling styles in neural machine translation (NMT) has attracted wide attention, as it is crucial for enhancing user experience.

Machine Translation NMT +1

Paper
Code

Zero-shot Domain Adaptation for Neural Machine Translation with Retrieved Phrase-level Prompts

no code implementations • 23 Sep 2022 • Zewei Sun, Qingnan Jiang, ShuJian Huang, Jun Cao, Shanbo Cheng, Mingxuan Wang

Domain adaptation is an important challenge for neural machine translation.

Domain Adaptation Machine Translation +1

Paper
Add Code

Unified Multimodal Punctuation Restoration Framework for Mixed-Modality Corpus

1 code implementation • 24 Jan 2022 • Yaoming Zhu, Liwei Wu, Shanbo Cheng, Mingxuan Wang

The punctuation restoration task aims to correctly punctuate the output transcriptions of automatic speech recognition systems.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +2

Paper
Code

Non-Autoregressive Models are Better Multilingual Translators

no code implementations • ICLR 2022 • Zhenqiao Song, Hao Zhou, Lihua Qian, Jingjing Xu, Shanbo Cheng, Mingxuan Wang, Lei LI

Multilingual machine translation aims to develop a single model for multiple language directions.

Machine Translation Sentence +1

Paper
Add Code

The Volctrans GLAT System: Non-autoregressive Translation Meets WMT21

no code implementations • WMT (EMNLP) 2021 • Lihua Qian, Yi Zhou, Zaixiang Zheng, Yaoming Zhu, Zehui Lin, Jiangtao Feng, Shanbo Cheng, Lei LI, Mingxuan Wang, Hao Zhou

This paper describes the Volctrans' submission to the WMT21 news translation shared task for German->English translation.

Translation

Paper
Add Code

Learning Kernel-Smoothed Machine Translation with Retrieved Examples

2 code implementations • EMNLP 2021 • Qingnan Jiang, Mingxuan Wang, Jun Cao, Shanbo Cheng, ShuJian Huang, Lei LI

How to effectively adapt neural machine translation (NMT) models according to emerging cases without retraining?

Domain Adaptation Machine Translation +3

Paper
Code

Language Tags Matter for Zero-Shot Neural Machine Translation

no code implementations • Findings (ACL) 2021 • Liwei Wu, Shanbo Cheng, Mingxuan Wang, Lei LI

Language tag (LT) strategies are often adopted to indicate the translation directions in MNMT.

Machine Translation TAG +1

Paper
Add Code

Language-aware Interlingua for Multilingual Neural Machine Translation

no code implementations • ACL 2020 • Changfeng Zhu, Heng Yu, Shanbo Cheng, Weihua Luo

However, the traditional multilingual model fails to capture the diversity and specificity of different languages, resulting in inferior performance compared with individual models that are sufficiently trained.

Machine Translation NMT +2

Paper
Add Code

AR: Auto-Repair the Synthetic Data for Neural Machine Translation

no code implementations • 5 Apr 2020 • Shanbo Cheng, Shaohui Kuang, Rongxiang Weng, Heng Yu, Changfeng Zhu, Weihua Luo

Compared with only using limited authentic parallel data as training corpus, many studies have proved that incorporating synthetic parallel data, which generated by back translation (BT) or forward translation (FT, or selftraining), into the NMT training process can significantly improve translation quality.

Machine Translation NMT +2

Paper
Add Code

Acquiring Knowledge from Pre-trained Model to Neural Machine Translation

no code implementations • 4 Dec 2019 • Rongxiang Weng, Heng Yu, Shu-Jian Huang, Shanbo Cheng, Weihua Luo

The standard paradigm of exploiting them includes two steps: first, pre-training a model, e. g. BERT, with a large scale unlabeled monolingual data.

General Knowledge Knowledge Distillation +3

Paper
Add Code

Improving Multilingual Semantic Textual Similarity with Shared Sentence Encoder for Low-resource Languages

no code implementations • 20 Oct 2018 • Xin Tang, Shanbo Cheng, Loc Do, Zhiyu Min, Feng Ji, Heng Yu, Ji Zhang, Haiqin Chen

Our approach is extended from a basic monolingual STS framework to a shared multilingual encoder pretrained with translation task to incorporate rich-resource language data.

Machine Translation Semantic Similarity +4