Search Results for author: Yangbin Shi

Found 6 papers, 1 papers with code

Alibaba’s Submission for the WMT 2020 APE Shared Task: Improving Automatic Post-Editing with Pre-trained Conditional Cross-Lingual BERT

no code implementations WMT (EMNLP) 2020 Jiayi Wang, Ke Wang, Kai Fan, Yuqi Zhang, Jun Lu, Xin Ge, Yangbin Shi, Yu Zhao

We also apply an imitation learning strategy to augment a reasonable amount of pseudo APE training data, potentially preventing the model to overfit on the limited real training data and boosting the performance on held-out data.

Automatic Post-Editing Benchmarking +4

Alibaba Submission to the WMT20 Parallel Corpus Filtering Task

no code implementations WMT (EMNLP) 2020 Jun Lu, Xin Ge, Yangbin Shi, Yuqi Zhang

In the filtering task, three main methods are applied to evaluate the quality of the parallel corpus, i. e. a) Dual Bilingual GPT-2 model, b) Dual Conditional Cross-Entropy Model and c) IBM word alignment model.

Language Identification Machine Translation +4

Combining Static Word Embeddings and Contextual Representations for Bilingual Lexicon Induction

1 code implementation Findings (ACL) 2021 Jinpeng Zhang, Baijun Ji, Nini Xiao, Xiangyu Duan, Min Zhang, Yangbin Shi, Weihua Luo

Bilingual Lexicon Induction (BLI) aims to map words in one language to their translations in another, and is typically through learning linear projections to align monolingual word representation spaces.

Bilingual Lexicon Induction Word Embeddings

Alibaba Submission to the WMT18 Parallel Corpus Filtering Task

no code implementations WS 2018 Jun Lu, Xiaoyu Lv, Yangbin Shi, Boxing Chen

This paper describes the Alibaba Machine Translation Group submissions to the WMT 2018 Shared Task on Parallel Corpus Filtering.

Machine Translation Sentence +2

Alibaba Submission for WMT18 Quality Estimation Task

no code implementations WS 2018 Jiayi Wang, Kai Fan, Bo Li, Fengming Zhou, Boxing Chen, Yangbin Shi, Luo Si

The goal of WMT 2018 Shared Task on Translation Quality Estimation is to investigate automatic methods for estimating the quality of machine translation results without reference translations.

Automatic Post-Editing Language Modelling +2

Cannot find the paper you are looking for? You can Submit a new open access paper.