Search Results for author: Yimeng Chen

Found 18 papers, 1 papers with code

HW-TSC’s Participation at WMT 2021 Quality Estimation Shared Task

no code implementations • WMT (EMNLP) 2021 • Yimeng Chen, Chang Su, Yingtao Zhang, Yuxia Wang, Xiang Geng, Hao Yang, Shimin Tao, Guo Jiaxin, Wang Minghan, Min Zhang, Yujia Liu, ShuJian Huang

This paper presents our work in WMT 2021 Quality Estimation (QE) Shared Task.

Data Augmentation Sentence +1

Paper
Add Code

The HW-TSC’s Simultaneous Speech Translation System for IWSLT 2022 Evaluation

no code implementations • IWSLT (ACL) 2022 • Minghan Wang, Jiaxin Guo, Yinglu Li, Xiaosong Qiao, Yuxia Wang, Zongyao Li, Chang Su, Yimeng Chen, Min Zhang, Shimin Tao, Hao Yang, Ying Qin

The cascade system is composed of a chunking-based streaming ASR model and the SimulMT model used in the T2T track.

Chunking Sentence +1

Paper
Add Code

HW-TSC’s Participation at WMT 2020 Quality Estimation Shared Task

no code implementations • WMT (EMNLP) 2020 • Minghan Wang, Hao Yang, Hengchao Shang, Daimeng Wei, Jiaxin Guo, Lizhi Lei, Ying Qin, Shimin Tao, Shiliang Sun, Yimeng Chen, Liangyou Li

This paper presents our work in the WMT 2020 Word and Sentence-Level Post-Editing Quality Estimation (QE) Shared Task.

Sentence Transfer Learning

Paper
Add Code

How Length Prediction Influence the Performance of Non-Autoregressive Translation?

no code implementations • EMNLP (BlackboxNLP) 2021 • Minghan Wang, Guo Jiaxin, Yuxia Wang, Yimeng Chen, Su Chang, Hengchao Shang, Min Zhang, Shimin Tao, Hao Yang

Length prediction is a special task in a series of NAT models where target length has to be determined before generation.

Language Modelling Translation

Paper
Add Code

The HW-TSC’s Offline Speech Translation System for IWSLT 2022 Evaluation

no code implementations • IWSLT (ACL) 2022 • Minghan Wang, Jiaxin Guo, Xiaosong Qiao, Yuxia Wang, Daimeng Wei, Chang Su, Yimeng Chen, Min Zhang, Shimin Tao, Hao Yang, Ying Qin

For machine translation part, we pretrained three translation models on WMT21 dataset and fine-tuned them on in-domain corpora.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

The HW-TSC’s Speech to Speech Translation System for IWSLT 2022 Evaluation

no code implementations • IWSLT (ACL) 2022 • Jiaxin Guo, Yinglu Li, Minghan Wang, Xiaosong Qiao, Yuxia Wang, Hengchao Shang, Chang Su, Yimeng Chen, Min Zhang, Shimin Tao, Hao Yang, Ying Qin

The paper presents the HW-TSC’s pipeline and results of Offline Speech to Speech Translation for IWSLT 2022.

Machine Translation Speech-to-Speech Translation +1

Paper
Add Code

Capture Human Disagreement Distributions by Calibrated Networks for Natural Language Inference

no code implementations • Findings (ACL) 2022 • Yuxia Wang, Minghan Wang, Yimeng Chen, Shimin Tao, Jiaxin Guo, Chang Su, Min Zhang, Hao Yang

Natural Language Inference (NLI) datasets contain examples with highly ambiguous labels due to its subjectivity.

Natural Language Inference

Paper
Add Code

Make the Blind Translator See The World: A Novel Transfer Learning Solution for Multimodal Machine Translation

no code implementations • MTSummit 2021 • Minghan Wang, Jiaxin Guo, Yimeng Chen, Chang Su, Min Zhang, Shimin Tao, Hao Yang

Based on large-scale pretrained networks and the liability to be easily overfitting with limited labelled training data of multimodal translation (MMT) is a critical issue in MMT.

Multimodal Machine Translation NMT +2

Paper
Add Code

HW-TSC’s Participation at WMT 2020 Automatic Post Editing Shared Task

no code implementations • WMT (EMNLP) 2020 • Hao Yang, Minghan Wang, Daimeng Wei, Hengchao Shang, Jiaxin Guo, Zongyao Li, Lizhi Lei, Ying Qin, Shimin Tao, Shiliang Sun, Yimeng Chen

The paper presents the submission by HW-TSC in the WMT 2020 Automatic Post Editing Shared Task.

Automatic Post-Editing NMT +1

Paper
Add Code

HI-CMLM: Improve CMLM with Hybrid Decoder Input

no code implementations • INLG (ACL) 2021 • Minghan Wang, Guo Jiaxin, Yuxia Wang, Yimeng Chen, Su Chang, Daimeng Wei, Min Zhang, Shimin Tao, Hao Yang

Mask-predict CMLM (Ghazvininejad et al., 2019) has achieved stunning performance among non-autoregressive NMT models, but we find that the mechanism of predicting all of the target words only depending on the hidden state of [MASK] is not effective and efficient in initial iterations of refinement, resulting in ungrammatical repetitions and slow convergence.

NMT Translation

Paper
Add Code

From Handcrafted Features to LLMs: A Brief Survey for Machine Translation Quality Estimation

no code implementations • 21 Mar 2024 • Haofei Zhao, Yilun Liu, Shimin Tao, Weibin Meng, Yimeng Chen, Xiang Geng, Chang Su, Min Zhang, Hao Yang

Machine Translation Quality Estimation (MTQE) is the task of estimating the quality of machine-translated text in real time without the need for reference translations, which is of great importance for the development of MT.

Machine Translation Sentence

Paper
Add Code

Explore and Exploit the Diverse Knowledge in Model Zoo for Domain Generalization

no code implementations • 5 Jun 2023 • Yimeng Chen, Tianyang Hu, Fengwei Zhou, Zhenguo Li, ZhiMing Ma

The proliferation of pretrained models, as a result of advancements in pretraining techniques, has led to the emergence of a vast zoo of publicly available models.

Domain Generalization Out-of-Distribution Generalization

Paper
Add Code

PEMP: Leveraging Physics Properties to Enhance Molecular Property Prediction

no code implementations • 18 Oct 2022 • Yuancheng Sun, Yimeng Chen, Weizhi Ma, Wenhao Huang, Kang Liu, ZhiMing Ma, Wei-Ying Ma, Yanyan Lan

In our implementation, we adopt both the state-of-the-art molecule embedding models under the supervised learning paradigm and the pretraining paradigm as the molecule representation module of PEMP, respectively.

Drug Discovery Molecular Property Prediction +2

Paper
Add Code

When Does Group Invariant Learning Survive Spurious Correlations?

1 code implementation • 29 Jun 2022 • Yimeng Chen, Ruibin Xiong, ZhiMing Ma, Yanyan Lan

Motivated by this, we design a new group invariant learning method, which constructs groups with statistical independence tests, and reweights samples by group label proportion to meet the criteria.

Out-of-Distribution Generalization

Paper
Code

Self-Distillation Mixup Training for Non-autoregressive Neural Machine Translation

no code implementations • 22 Dec 2021 • Jiaxin Guo, Minghan Wang, Daimeng Wei, Hengchao Shang, Yuxia Wang, Zongyao Li, Zhengzhe Yu, Zhanglin Wu, Yimeng Chen, Chang Su, Min Zhang, Lizhi Lei, Shimin Tao, Hao Yang

An effective training strategy to improve the performance of AT models is Self-Distillation Mixup (SDM) Training, which pre-trains a model on raw data, generates distilled data by the pre-trained model itself and finally re-trains a model on the combination of raw data and distilled data.

Knowledge Distillation Machine Translation +1

Paper
Add Code

Diformer: Directional Transformer for Neural Machine Translation

no code implementations • EAMT 2022 • Minghan Wang, Jiaxin Guo, Yuxia Wang, Daimeng Wei, Hengchao Shang, Chang Su, Yimeng Chen, Yinglu Li, Min Zhang, Shimin Tao, Hao Yang

In this paper, we aim to close the gap by preserving the original objective of AR and NAR under a unified framework.

Language Modelling Machine Translation +1

Paper
Add Code

Joint-training on Symbiosis Networks for Deep Nueral Machine Translation models

no code implementations • 22 Dec 2021 • Zhengzhe Yu, Jiaxin Guo, Minghan Wang, Daimeng Wei, Hengchao Shang, Zongyao Li, Zhanglin Wu, Yuxia Wang, Yimeng Chen, Chang Su, Min Zhang, Lizhi Lei, Shimin Tao, Hao Yang

Deep encoders have been proven to be effective in improving neural machine translation (NMT) systems, but it reaches the upper bound of translation quality when the number of encoder layers exceeds 18.

Machine Translation NMT +1

Paper
Add Code

Uncertainty Calibration for Ensemble-Based Debiasing Methods

no code implementations • NeurIPS 2021 • Ruibin Xiong, Yimeng Chen, Liang Pang, Xueqi Chen, Yanyan Lan

Ensemble-based debiasing methods have been shown effective in mitigating the reliance of classifiers on specific dataset bias, by exploiting the output of a bias-only model to adjust the learning target.

Fact Verification

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.