Search Results for author: Jianhao Yan

Found 18 papers, 10 papers with code

Potential and Challenges of Model Editing for Social Debiasing

no code implementations21 Feb 2024 Jianhao Yan, Futing Wang, Yafu Li, Yue Zhang

Large language models (LLMs) trained on vast corpora suffer from inevitable stereotype biases.

Model Editing

Understanding In-Context Learning from Repetitions

1 code implementation30 Sep 2023 Jianhao Yan, Jin Xu, Chiyu Song, Chenming Wu, Yafu Li, Yue Zhang

This paper explores the elusive mechanism underpinning in-context learning in Large Language Models (LLMs).

In-Context Learning Text Generation

Revisiting Cross-Lingual Summarization: A Corpus-based Study and A New Benchmark with Improved Annotation

1 code implementation8 Jul 2023 Yulong Chen, Huajian Zhang, Yijie Zhou, Xuefeng Bai, Yueguan Wang, Ming Zhong, Jianhao Yan, Yafu Li, Judy Li, Michael Zhu, Yue Zhang

Additionally, based on the same intuition, we propose a 2-Step method, which takes both conversation and summary as input to simulate human annotation process.

DC-MBR: Distributional Cooling for Minimum Bayesian Risk Decoding

no code implementations8 Dec 2022 Jianhao Yan, Jin Xu, Fandong Meng, Jie zhou, Yue Zhang

In this work, we show that the issue arises from the un-consistency of label smoothing on the token-level and sequence-level distributions.

Machine Translation NMT

Probing Causes of Hallucinations in Neural Machine Translations

no code implementations25 Jun 2022 Jianhao Yan, Fandong Meng, Jie zhou

Hallucination, one kind of pathological translations that bothers Neural Machine Translation, has recently drawn much attention.

Hallucination Machine Translation +2

Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation

2 code implementations6 Jun 2022 Jin Xu, Xiaojiang Liu, Jianhao Yan, Deng Cai, Huayang Li, Jian Li

While large-scale neural language models, such as GPT2 and BART, have achieved impressive results on various text generation tasks, they tend to get stuck in undesirable sentence-level loops with maximization-based decoding algorithms (\textit{e. g.}, greedy search).

Sentence Text Generation +1

Digging Errors in NMT: Evaluating and Understanding Model Errors from Partial Hypothesis Space

no code implementations29 Jun 2021 Jianhao Yan, Chenming Wu, Fandong Meng, Jie zhou

Current evaluation of an NMT system is usually built upon a heuristic decoding algorithm (e. g., beam search) and an evaluation metric assessing similarity between the translation and golden reference.

Data Augmentation Inductive Bias +3

Selective Knowledge Distillation for Neural Machine Translation

1 code implementation ACL 2021 Fusheng Wang, Jianhao Yan, Fandong Meng, Jie zhou

As an active research field in NMT, knowledge distillation is widely applied to enhance the model's performance by transferring teacher model's knowledge on each training sample.

Knowledge Distillation Machine Translation +2

A Sentiment-Controllable Topic-to-Essay Generator with Topic Knowledge Graph

no code implementations Findings of the Association for Computational Linguistics 2020 Lin Qiao, Jianhao Yan, Fandong Meng, Zhendong Yang, Jie zhou

Therefore, we propose a novel Sentiment-Controllable topic-to-essay generator with a Topic Knowledge Graph enhanced decoder, named SCTKG, which is based on the conditional variational autoencoder (CVAE) framework.

Sentence Text Generation

Dual Past and Future for Neural Machine Translation

no code implementations15 Jul 2020 Jianhao Yan, Fandong Meng, Jie zhou

Though remarkable successes have been achieved by Neural Machine Translation (NMT) in recent years, it still suffers from the inadequate-translation problem.

Machine Translation NMT +2

Learning to Encode Evolutionary Knowledge for Automatic Commenting Long Novels

no code implementations21 Apr 2020 Canxiang Yan, Jianhao Yan, Yangyin Xu, Cheng Niu, Jie zhou

Static knowledge graph has been incorporated extensively into sequence-to-sequence framework for text generation.

Comment Generation Graph-to-Sequence +1

Relation Extraction with Temporal Reasoning Based on Memory Augmented Distant Supervision

1 code implementation NAACL 2019 Jianhao Yan, Lin He, Ruqin Huang, Jian Li, Ying Liu

This paper formulates the problem of relation extraction with temporal reasoning and proposes a solution to predict whether two given entities participate in a relation at a given time spot.

Relation Relation Extraction +1

Cannot find the paper you are looking for? You can Submit a new open access paper.