Search Results for author: Jiahuan Li

Found 8 papers, 1 papers with code

Data Augmentation for Low-resource Word Segmentation and POS Tagging of Ancient Chinese Texts

no code implementations LT4HALA (LREC) 2022 Yutong Shen, Jiahuan Li, ShuJian Huang, Yi Zhou, Xiaopeng Xie, Qinxin Zhao

Although SikuRoberta significantly boosts performance on WSG and POS tasks on ancient Chinese texts, the lack of labeled data still limits the performance of the model.

Data Augmentation Language Modelling +3

Why Not Transform Chat Large Language Models to Non-English?

1 code implementation22 May 2024 Xiang Geng, Ming Zhu, Jiahuan Li, Zhejian Lai, Wei Zou, Shuaijie She, Jiaxin Guo, Xiaofeng Zhao, Yinglu Li, Yuang Li, Chang Su, Yanqing Zhao, Min Zhang, Hao Yang, Xinglin Lyu, Jiajun Chen, ShuJian Huang

For the second issue, we propose a method comprising two synergistic components: low-rank adaptation for training to maintain the original LLM parameters, and recovery KD, which utilizes data generated by the chat LLM itself to recover the original knowledge from the frozen parameters.

MT-PATCHER: Selective and Extendable Knowledge Distillation from Large Language Models for Machine Translation

no code implementations14 Mar 2024 Jiahuan Li, Shanbo Cheng, ShuJian Huang, Jiajun Chen

Large Language Models (LLM) have demonstrated their strong ability in the field of machine translation (MT), yet they suffer from high computational cost and latency.

Knowledge Distillation Machine Translation +1

Eliciting the Translation Ability of Large Language Models via Multilingual Finetuning with Translation Instructions

no code implementations24 May 2023 Jiahuan Li, Hao Zhou, ShuJian Huang, Shanbo Cheng, Jiajun Chen

Secondly, we find that LLMs' ability to carry out translation instructions relies on the understanding of translation instructions and the alignment among different languages.

Language Modelling Translation

DirectQE: Direct Pretraining for Machine Translation Quality Estimation

no code implementations15 May 2021 Qu Cui, ShuJian Huang, Jiahuan Li, Xiang Geng, Zaixiang Zheng, Guoping Huang, Jiajun Chen

However, we argue that there are gaps between the predictor and the estimator in both data quality and training objectives, which preclude QE models from benefiting from a large number of parallel corpora more directly.

Machine Translation Translation

Explicit Semantic Decomposition for Definition Generation

no code implementations ACL 2020 Jiahuan Li, Yu Bao, Shu-Jian Huang, Xin-yu Dai, Jia-Jun Chen

Definition generation, which aims to automatically generate dictionary definitions for words, has recently been proposed to assist the construction of dictionaries and help people understand unfamiliar texts.

Cannot find the paper you are looking for? You can Submit a new open access paper.