Search Results for author: Jianhao Yan

Found 18 papers, 10 papers with code

RefuteBench: Evaluating Refuting Instruction-Following for Large Language Models

1 code implementation • 21 Feb 2024 • Jianhao Yan, Yun Luo, Yue Zhang

The application scope of large language models (LLMs) is increasingly expanding.

Instruction Following Machine Translation +1

Paper
Code

Potential and Challenges of Model Editing for Social Debiasing

no code implementations • 21 Feb 2024 • Jianhao Yan, Futing Wang, Yafu Li, Yue Zhang

Large language models (LLMs) trained on vast corpora suffer from inevitable stereotype biases.

Model Editing

Paper
Add Code

Dynamics of Instruction Tuning: Each Ability of Large Language Models Has Its Own Growth Pace

1 code implementation • 30 Oct 2023 • Chiyu Song, Zhanchao Zhou, Jianhao Yan, Yuejiao Fei, Zhenzhong Lan, Yue Zhang

Instruction tuning is a burgeoning method to elicit the general intelligence of Large Language Models (LLMs).

Code Generation Logical Reasoning

Paper
Code

Understanding In-Context Learning from Repetitions

1 code implementation • 30 Sep 2023 • Jianhao Yan, Jin Xu, Chiyu Song, Chenming Wu, Yafu Li, Yue Zhang

This paper explores the elusive mechanism underpinning in-context learning in Large Language Models (LLMs).

In-Context Learning Text Generation

Paper
Code

Revisiting Cross-Lingual Summarization: A Corpus-based Study and A New Benchmark with Improved Annotation

1 code implementation • 8 Jul 2023 • Yulong Chen, Huajian Zhang, Yijie Zhou, Xuefeng Bai, Yueguan Wang, Ming Zhong, Jianhao Yan, Yafu Li, Judy Li, Michael Zhu, Yue Zhang

Additionally, based on the same intuition, we propose a 2-Step method, which takes both conversation and summary as input to simulate human annotation process.

Paper
Code

Explicit Syntactic Guidance for Neural Text Generation

1 code implementation • 20 Jun 2023 • Yafu Li, Leyang Cui, Jianhao Yan, Yongjing Yin, Wei Bi, Shuming Shi, Yue Zhang

Most existing text generation models follow the sequence-to-sequence paradigm.

Machine Translation Paraphrase Generation +1

Paper
Code

Non-Autoregressive Document-Level Machine Translation

1 code implementation • 22 May 2023 • Guangsheng Bao, Zhiyang Teng, Hao Zhou, Jianhao Yan, Yue Zhang

However, current NAT models still have a significant performance gap compared to their AT counterparts.

Document Level Machine Translation Machine Translation +3

Paper
Code

DC-MBR: Distributional Cooling for Minimum Bayesian Risk Decoding

no code implementations • 8 Dec 2022 • Jianhao Yan, Jin Xu, Fandong Meng, Jie zhou, Yue Zhang

In this work, we show that the issue arises from the un-consistency of label smoothing on the token-level and sequence-level distributions.

Machine Translation NMT

Paper
Add Code

Probing Causes of Hallucinations in Neural Machine Translations

no code implementations • 25 Jun 2022 • Jianhao Yan, Fandong Meng, Jie zhou

Hallucination, one kind of pathological translations that bothers Neural Machine Translation, has recently drawn much attention.

Hallucination Machine Translation +2

Paper
Add Code

Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation

2 code implementations • 6 Jun 2022 • Jin Xu, Xiaojiang Liu, Jianhao Yan, Deng Cai, Huayang Li, Jian Li

While large-scale neural language models, such as GPT2 and BART, have achieved impressive results on various text generation tasks, they tend to get stuck in undesirable sentence-level loops with maximization-based decoding algorithms (\textit{e. g.}, greedy search).

Sentence Text Generation +1

Paper
Code

Digging Errors in NMT: Evaluating and Understanding Model Errors from Partial Hypothesis Space

no code implementations • 29 Jun 2021 • Jianhao Yan, Chenming Wu, Fandong Meng, Jie zhou

Current evaluation of an NMT system is usually built upon a heuristic decoding algorithm (e. g., beam search) and an evaluation metric assessing similarity between the translation and golden reference.

Data Augmentation Inductive Bias +3

Paper
Add Code

Selective Knowledge Distillation for Neural Machine Translation

1 code implementation • ACL 2021 • Fusheng Wang, Jianhao Yan, Fandong Meng, Jie zhou

As an active research field in NMT, knowledge distillation is widely applied to enhance the model's performance by transferring teacher model's knowledge on each training sample.

Knowledge Distillation Machine Translation +2

Paper
Code

Multi-Unit Transformers for Neural Machine Translation

1 code implementation • EMNLP 2020 • Jianhao Yan, Fandong Meng, Jie zhou

Transformer models achieve remarkable success in Neural Machine Translation.

Machine Translation Translation

Paper
Code

A Sentiment-Controllable Topic-to-Essay Generator with Topic Knowledge Graph

no code implementations • Findings of the Association for Computational Linguistics 2020 • Lin Qiao, Jianhao Yan, Fandong Meng, Zhendong Yang, Jie zhou

Therefore, we propose a novel Sentiment-Controllable topic-to-essay generator with a Topic Knowledge Graph enhanced decoder, named SCTKG, which is based on the conditional variational autoencoder (CVAE) framework.

Sentence Text Generation

Paper
Add Code

WeChat Neural Machine Translation Systems for WMT20

no code implementations • WMT (EMNLP) 2020 • Fandong Meng, Jianhao Yan, Yijin Liu, Yuan Gao, Xianfeng Zeng, Qinsong Zeng, Peng Li, Ming Chen, Jie zhou, Sifan Liu, Hao Zhou

We participate in the WMT 2020 shared news translation task on Chinese to English.

Knowledge Distillation Machine Translation +3

Paper
Add Code

Dual Past and Future for Neural Machine Translation

no code implementations • 15 Jul 2020 • Jianhao Yan, Fandong Meng, Jie zhou

Though remarkable successes have been achieved by Neural Machine Translation (NMT) in recent years, it still suffers from the inadequate-translation problem.

Machine Translation NMT +2

Paper
Add Code

Learning to Encode Evolutionary Knowledge for Automatic Commenting Long Novels

no code implementations • 21 Apr 2020 • Canxiang Yan, Jianhao Yan, Yangyin Xu, Cheng Niu, Jie zhou

Static knowledge graph has been incorporated extensively into sequence-to-sequence framework for text generation.

Comment Generation Graph-to-Sequence +1

Paper
Add Code

Relation Extraction with Temporal Reasoning Based on Memory Augmented Distant Supervision

1 code implementation • NAACL 2019 • Jianhao Yan, Lin He, Ruqin Huang, Jian Li, Ying Liu

This paper formulates the problem of relation extraction with temporal reasoning and proposes a solution to predict whether two given entities participate in a relation at a given time spot.

Relation Relation Extraction +1

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.