Search Results for author: Minghan Wang

Found 37 papers, 2 papers with code

Capture Human Disagreement Distributions by Calibrated Networks for Natural Language Inference

no code implementations • Findings (ACL) 2022 • Yuxia Wang, Minghan Wang, Yimeng Chen, Shimin Tao, Jiaxin Guo, Chang Su, Min Zhang, Hao Yang

Natural Language Inference (NLI) datasets contain examples with highly ambiguous labels due to its subjectivity.

Natural Language Inference

Paper
Add Code

HW-TSC’s Participation in the WMT 2020 News Translation Shared Task

no code implementations • WMT (EMNLP) 2020 • Daimeng Wei, Hengchao Shang, Zhanglin Wu, Zhengzhe Yu, Liangyou Li, Jiaxin Guo, Minghan Wang, Hao Yang, Lizhi Lei, Ying Qin, Shiliang Sun

We also conduct experiment with similar language augmentation, which lead to positive results, although not used in our submission.

Knowledge Distillation Translation

Paper
Add Code

HW-TSC’s Participation at WMT 2020 Automatic Post Editing Shared Task

no code implementations • WMT (EMNLP) 2020 • Hao Yang, Minghan Wang, Daimeng Wei, Hengchao Shang, Jiaxin Guo, Zongyao Li, Lizhi Lei, Ying Qin, Shimin Tao, Shiliang Sun, Yimeng Chen

The paper presents the submission by HW-TSC in the WMT 2020 Automatic Post Editing Shared Task.

Automatic Post-Editing NMT +1

Paper
Add Code

Huawei’s Submissions to the WMT20 Biomedical Translation Task

no code implementations • WMT (EMNLP) 2020 • Wei Peng, Jianfeng Liu, Minghan Wang, Liangyou Li, Xupeng Meng, Hao Yang, Qun Liu

This paper describes Huawei’s submissions to the WMT20 biomedical translation shared task.

Machine Translation Transfer Learning +1

Paper
Add Code

HW-TSC’s Participation in the WMT 2021 News Translation Shared Task

no code implementations • WMT (EMNLP) 2021 • Daimeng Wei, Zongyao Li, Zhanglin Wu, Zhengzhe Yu, Xiaoyu Chen, Hengchao Shang, Jiaxin Guo, Minghan Wang, Lizhi Lei, Min Zhang, Hao Yang, Ying Qin

This paper presents the submission of Huawei Translate Services Center (HW-TSC) to the WMT 2021 News Translation Shared Task.

Knowledge Distillation Translation

Paper
Add Code

HW-TSC’s Participation in the WMT 2021 Triangular MT Shared Task

no code implementations • WMT (EMNLP) 2021 • Zongyao Li, Daimeng Wei, Hengchao Shang, Xiaoyu Chen, Zhanglin Wu, Zhengzhe Yu, Jiaxin Guo, Minghan Wang, Lizhi Lei, Min Zhang, Hao Yang, Ying Qin

This paper presents the submission of Huawei Translation Service Center (HW-TSC) to WMT 2021 Triangular MT Shared Task.

Denoising Translation

Paper
Add Code

HW-TSC’s Participation in the WMT 2021 Large-Scale Multilingual Translation Task

no code implementations • WMT (EMNLP) 2021 • Zhengzhe Yu, Daimeng Wei, Zongyao Li, Hengchao Shang, Xiaoyu Chen, Zhanglin Wu, Jiaxin Guo, Minghan Wang, Lizhi Lei, Min Zhang, Hao Yang, Ying Qin

This paper presents the submission of Huawei Translation Services Center (HW-TSC) to the WMT 2021 Large-Scale Multilingual Translation Task.

Knowledge Distillation Translation

Paper
Add Code

HW-TSC’s Submissions to the WMT21 Biomedical Translation Task

no code implementations • WMT (EMNLP) 2021 • Hao Yang, Zhanglin Wu, Zhengzhe Yu, Xiaoyu Chen, Daimeng Wei, Zongyao Li, Hengchao Shang, Minghan Wang, Jiaxin Guo, Lizhi Lei, Chuanfei Xu, Min Zhang, Ying Qin

This paper describes the submission of Huawei Translation Service Center (HW-TSC) to WMT21 biomedical translation task in two language pairs: Chinese↔English and German↔English (Our registered team name is HuaweiTSC).

Translation

Paper
Add Code

The HW-TSC’s Simultaneous Speech Translation System for IWSLT 2022 Evaluation

no code implementations • IWSLT (ACL) 2022 • Minghan Wang, Jiaxin Guo, Yinglu Li, Xiaosong Qiao, Yuxia Wang, Zongyao Li, Chang Su, Yimeng Chen, Min Zhang, Shimin Tao, Hao Yang, Ying Qin

The cascade system is composed of a chunking-based streaming ASR model and the SimulMT model used in the T2T track.

Chunking Sentence +1

Paper
Add Code

HW-TSC’s Participation in the IWSLT 2022 Isometric Spoken Language Translation

no code implementations • IWSLT (ACL) 2022 • Zongyao Li, Jiaxin Guo, Daimeng Wei, Hengchao Shang, Minghan Wang, Ting Zhu, Zhanglin Wu, Zhengzhe Yu, Xiaoyu Chen, Lizhi Lei, Hao Yang, Ying Qin

This paper presents our submissions to the IWSLT 2022 Isometric Spoken Language Translation task.

Decoder Translation

Paper
Add Code

HW-TSC’s Participation at WMT 2020 Quality Estimation Shared Task

no code implementations • WMT (EMNLP) 2020 • Minghan Wang, Hao Yang, Hengchao Shang, Daimeng Wei, Jiaxin Guo, Lizhi Lei, Ying Qin, Shimin Tao, Shiliang Sun, Yimeng Chen, Liangyou Li

This paper presents our work in the WMT 2020 Word and Sentence-Level Post-Editing Quality Estimation (QE) Shared Task.

Sentence Transfer Learning

Paper
Add Code

How Length Prediction Influence the Performance of Non-Autoregressive Translation?

no code implementations • EMNLP (BlackboxNLP) 2021 • Minghan Wang, Guo Jiaxin, Yuxia Wang, Yimeng Chen, Su Chang, Hengchao Shang, Min Zhang, Shimin Tao, Hao Yang

Length prediction is a special task in a series of NAT models where target length has to be determined before generation.

Language Modelling Translation

Paper
Add Code

HW-TSC at SemEval-2022 Task 3: A Unified Approach Fine-tuned on Multilingual Pretrained Model for PreTENS

no code implementations • SemEval (NAACL) 2022 • Yinglu Li, Min Zhang, Xiaosong Qiao, Minghan Wang

In order to verify whether our model could also perform better in subtask 2 (the regression subtask), the ranking score is transformed into classification labels by an up-sampling strategy.

Binary Classification TAG

Paper
Add Code

HW-TSC at SemEval-2022 Task 7: Ensemble Model Based on Pretrained Models for Identifying Plausible Clarifications

no code implementations • SemEval (NAACL) 2022 • Xiaosong Qiao, Yinglu Li, Min Zhang, Minghan Wang, Hao Yang, Shimin Tao, Qin Ying

This paper describes the system for the identifying Plausible Clarifications of Implicit and Underspecified Phrases.

regression

Paper
Add Code

HI-CMLM: Improve CMLM with Hybrid Decoder Input

no code implementations • INLG (ACL) 2021 • Minghan Wang, Guo Jiaxin, Yuxia Wang, Yimeng Chen, Su Chang, Daimeng Wei, Min Zhang, Shimin Tao, Hao Yang

Mask-predict CMLM (Ghazvininejad et al., 2019) has achieved stunning performance among non-autoregressive NMT models, but we find that the mechanism of predicting all of the target words only depending on the hidden state of [MASK] is not effective and efficient in initial iterations of refinement, resulting in ungrammatical repetitions and slow convergence.

Decoder NMT +1

Paper
Add Code

Make the Blind Translator See The World: A Novel Transfer Learning Solution for Multimodal Machine Translation

no code implementations • MTSummit 2021 • Minghan Wang, Jiaxin Guo, Yimeng Chen, Chang Su, Min Zhang, Shimin Tao, Hao Yang

Based on large-scale pretrained networks and the liability to be easily overfitting with limited labelled training data of multimodal translation (MMT) is a critical issue in MMT.

Multimodal Machine Translation NMT +2

Paper
Add Code

Efficient Transfer Learning for Quality Estimation with Bottleneck Adapter Layer

no code implementations • EAMT 2020 • Hao Yang, Minghan Wang, Ning Xie, Ying Qin, Yao Deng

Compared with the commonly used NuQE baseline, BAL-QE achieves 47% (En-Ru) and 75% (En-De) of performance promotions.

NMT Transfer Learning

Paper
Add Code

Unified Humor Detection Based on Sentence-pair Augmentation and Transfer Learning

no code implementations • EAMT 2020 • Minghan Wang, Hao Yang, Ying Qin, Shiliang Sun, Yao Deng

We propose a unified multilingual model for humor detection which can be trained under a transfer learning framework.

Humor Detection Sentence +2

Paper
Add Code

HW-TSC’s Participation in the WAT 2020 Indic Languages Multilingual Task

no code implementations • AACL (WAT) 2020 • Zhengzhe Yu, Zhanglin Wu, Xiaoyu Chen, Daimeng Wei, Hengchao Shang, Jiaxin Guo, Zongyao Li, Minghan Wang, Liangyou Li, Lizhi Lei, Hao Yang, Ying Qin

This paper describes our work in the WAT 2020 Indic Multilingual Translation Task.

Translation

Paper
Add Code

The HW-TSC’s Offline Speech Translation System for IWSLT 2022 Evaluation

no code implementations • IWSLT (ACL) 2022 • Minghan Wang, Jiaxin Guo, Xiaosong Qiao, Yuxia Wang, Daimeng Wei, Chang Su, Yimeng Chen, Min Zhang, Shimin Tao, Hao Yang, Ying Qin

For machine translation part, we pretrained three translation models on WMT21 dataset and fine-tuned them on in-domain corpora.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +3

Paper
Add Code

The HW-TSC’s Speech to Speech Translation System for IWSLT 2022 Evaluation

no code implementations • IWSLT (ACL) 2022 • Jiaxin Guo, Yinglu Li, Minghan Wang, Xiaosong Qiao, Yuxia Wang, Hengchao Shang, Chang Su, Yimeng Chen, Min Zhang, Shimin Tao, Hao Yang, Ying Qin

The paper presents the HW-TSC’s pipeline and results of Offline Speech to Speech Translation for IWSLT 2022.

Machine Translation Speech-to-Speech Translation +1

Paper
Add Code

OpenFactCheck: A Unified Framework for Factuality Evaluation of LLMs

1 code implementation • 9 May 2024 • Yuxia Wang, Minghan Wang, Hasan Iqbal, Georgi Georgiev, Jiahui Geng, Preslav Nakov

The increased use of large language models (LLMs) across a variety of real-world applications calls for mechanisms to verify the factual accuracy of their outputs.

Paper
Code

Against The Achilles' Heel: A Survey on Red Teaming for Generative Models

no code implementations • 31 Mar 2024 • Lizhi Lin, Honglin Mu, Zenan Zhai, Minghan Wang, Yuxia Wang, Renxi Wang, Junjie Gao, Yixuan Zhang, Wanxiang Che, Timothy Baldwin, Xudong Han, Haonan Li

Generative models are rapidly gaining popularity and being integrated into everyday applications, raising concerns over their safety issues as various vulnerabilities are exposed.

Paper
Add Code

Conversational SimulMT: Efficient Simultaneous Translation with Large Language Models

no code implementations • 16 Feb 2024 • Minghan Wang, Thuy-Trang Vu, Ehsan Shareghi, Gholamreza Haffari

Simultaneous machine translation (SimulMT) presents a challenging trade-off between translation quality and latency.

Machine Translation Translation

Paper
Add Code

Factuality of Large Language Models in the Year 2024

no code implementations • 4 Feb 2024 • Yuxia Wang, Minghan Wang, Muhammad Arslan Manzoor, Fei Liu, Georgi Georgiev, Rocktim Jyoti Das, Preslav Nakov

Large language models (LLMs), especially when instruction-tuned for chat, have become part of our daily lives, freeing people from the process of searching, extracting, and integrating information from multiple sources by offering a straightforward answer to a variety of questions in a single place.

Text Generation

Paper
Add Code

UCorrect: An Unsupervised Framework for Automatic Speech Recognition Error Correction

no code implementations • 11 Jan 2024 • Jiaxin Guo, Minghan Wang, Xiaosong Qiao, Daimeng Wei, Hengchao Shang, Zongyao Li, Zhengzhe Yu, Yinglu Li, Chang Su, Min Zhang, Shimin Tao, Hao Yang

Previous works usually adopt end-to-end models and has strong dependency on Pseudo Paired Data and Original Paired Data.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +1

Paper
Add Code

INarIG: Iterative Non-autoregressive Instruct Generation Model For Word-Level Auto Completion

no code implementations • 30 Nov 2023 • Hengchao Shang, Zongyao Li, Daimeng Wei, Jiaxin Guo, Minghan Wang, Xiaoyu Chen, Lizhi Lei, Hao Yang

WLAC predicts a target word given a source sentence, translation context, and a human typed character sequence.

Machine Translation Sentence +1

Paper
Add Code

Rethinking STS and NLI in Large Language Models

no code implementations • 16 Sep 2023 • Yuxia Wang, Minghan Wang, Preslav Nakov

Recent years have seen the rise of large language models (LLMs), where practitioners use task-specific prompts; this was shown to be effective for a variety of tasks.

Natural Language Inference Semantic Textual Similarity +1

Paper
Add Code

Simultaneous Machine Translation with Large Language Models

no code implementations • 13 Sep 2023 • Minghan Wang, Jinming Zhao, Thuy-Trang Vu, Fatemeh Shiri, Ehsan Shareghi, Gholamreza Haffari

The results show that LLM outperforms dedicated MT models in terms of BLEU and LAAL metrics.

Machine Translation Translation

Paper
Add Code

Knowledge-Prompted Estimator: A Novel Approach to Explainable Machine Translation Assessment

no code implementations • 13 Jun 2023 • Hao Yang, Min Zhang, Shimin Tao, Minghan Wang, Daimeng Wei, Yanfei Jiang

Cross-lingual Machine Translation (MT) quality estimation plays a crucial role in evaluating translation performance.

Machine Translation Sentence +1

Paper
Add Code

Text Style Transfer Back-Translation

1 code implementation • 2 Jun 2023 • Daimeng Wei, Zhanglin Wu, Hengchao Shang, Zongyao Li, Minghan Wang, Jiaxin Guo, Xiaoyu Chen, Zhengzhe Yu, Hao Yang

To address this issue, we propose Text Style Transfer Back Translation (TST BT), which uses a style transfer model to modify the source side of BT data.

Data Augmentation Domain Adaptation +4

Paper
Code

Diformer: Directional Transformer for Neural Machine Translation

no code implementations • EAMT 2022 • Minghan Wang, Jiaxin Guo, Yuxia Wang, Daimeng Wei, Hengchao Shang, Chang Su, Yimeng Chen, Yinglu Li, Min Zhang, Shimin Tao, Hao Yang

In this paper, we aim to close the gap by preserving the original objective of AR and NAR under a unified framework.

Language Modelling Machine Translation +1

Paper
Add Code

Joint-training on Symbiosis Networks for Deep Nueral Machine Translation models

no code implementations • 22 Dec 2021 • Zhengzhe Yu, Jiaxin Guo, Minghan Wang, Daimeng Wei, Hengchao Shang, Zongyao Li, Zhanglin Wu, Yuxia Wang, Yimeng Chen, Chang Su, Min Zhang, Lizhi Lei, Shimin Tao, Hao Yang

Deep encoders have been proven to be effective in improving neural machine translation (NMT) systems, but it reaches the upper bound of translation quality when the number of encoder layers exceeds 18.

Machine Translation NMT +1

Paper
Add Code

Self-Distillation Mixup Training for Non-autoregressive Neural Machine Translation

no code implementations • 22 Dec 2021 • Jiaxin Guo, Minghan Wang, Daimeng Wei, Hengchao Shang, Yuxia Wang, Zongyao Li, Zhengzhe Yu, Zhanglin Wu, Yimeng Chen, Chang Su, Min Zhang, Lizhi Lei, Shimin Tao, Hao Yang

An effective training strategy to improve the performance of AT models is Self-Distillation Mixup (SDM) Training, which pre-trains a model on raw data, generates distilled data by the pre-trained model itself and finally re-trains a model on the combination of raw data and distilled data.

Knowledge Distillation Machine Translation +1

Paper
Add Code

The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation

no code implementations • 9 Aug 2021 • Minghan Wang, Yuxia Wang, Chang Su, Jiaxin Guo, Yingtao Zhang, Yujia Liu, Min Zhang, Shimin Tao, Xingshan Zeng, Liangyou Li, Hao Yang, Ying Qin

This paper describes our work in participation of the IWSLT-2021 offline speech translation task.

Automatic Speech Recognition Automatic Speech Recognition (ASR) +5

Paper
Add Code

The HW-TSC Video Speech Translation System at IWSLT 2020

no code implementations • WS 2020 • Minghan Wang, Hao Yang, Yao Deng, Ying Qin, Lizhi Lei, Daimeng Wei, Hengchao Shang, Ning Xie, Xiaochun Li, Jiaxian Guo

The paper presents details of our system in the IWSLT Video Speech Translation evaluation.

NMT Translation

Paper
Add Code

UniMelb at SemEval-2019 Task 12: Multi-model combination for toponym resolution

no code implementations • SEMEVAL 2019 • Haonan Li, Minghan Wang, Timothy Baldwin, Martin Tomko, Maria Vasardani

This paper describes our submission to SemEval-2019 Task 12 on toponym resolution over scientific articles.

NER Toponym Resolution

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.