Search Results for author: Chunliang Zhang

Found 11 papers, 5 papers with code

Prior Constraints-based Reward Model Training for Aligning Large Language Models

1 code implementation • 1 Apr 2024 • Hang Zhou, Chenglong Wang, Yimin Hu, Tong Xiao, Chunliang Zhang, Jingbo Zhu

Reinforcement learning with human feedback for aligning large language models (LLMs) trains a reward model typically using ranking loss with comparison pairs. However, the training procedure suffers from an inherent problem: the uncontrolled scaling of reward scores during reinforcement learning due to the lack of constraints while training the reward model. This paper proposes a Prior Constraints-based Reward Model (namely PCRM) training method to mitigate this problem.

reinforcement-learning

9

Paper
Code

Large Language Models are Parallel Multilingual Learners

1 code implementation • 14 Mar 2024 • Yongyu Mu, Peinan Feng, Zhiquan Cao, Yuzhang Wu, Bei Li, Chenglong Wang, Tong Xiao, Kai Song, Tongran Liu, Chunliang Zhang, Jingbo Zhu

In this study, we reveal an in-context learning (ICL) capability of multilingual large language models (LLMs): by translating the input to several languages, we provide Parallel Input in Multiple Languages (PiM) to LLMs, which significantly enhances their comprehension abilities.

In-Context Learning

6

Paper
Code

Soft Alignment of Modality Space for End-to-end Speech Translation

no code implementations • 18 Dec 2023 • Yuhao Zhang, Kaiqi Kou, Bei Li, Chen Xu, Chunliang Zhang, Tong Xiao, Jingbo Zhu

End-to-end Speech Translation (ST) aims to convert speech into target text within a unified model.

Cross-Lingual Transfer Translation

Paper
Add Code

Rethinking and Improving Multi-task Learning for End-to-end Speech Translation

1 code implementation • 7 Nov 2023 • Yuhao Zhang, Chen Xu, Bei Li, Hao Chen, Tong Xiao, Chunliang Zhang, Jingbo Zhu

Significant improvements in end-to-end speech translation (ST) have been achieved through the application of multi-task learning.

Multi-Task Learning

0

Paper
Code

Learning Evaluation Models from Large Language Models for Sequence Generation

no code implementations • 8 Aug 2023 • Chenglong Wang, Hang Zhou, Kaiyan Chang, Tongran Liu, Chunliang Zhang, Quan Du, Tong Xiao, Jingbo Zhu

Large language models achieve state-of-the-art performance on sequence generation evaluation, but typically have a large number of parameters.

Machine Translation Style Transfer +1

Paper
Add Code

Augmenting Large Language Model Translators via Translation Memories

no code implementations • 27 May 2023 • Yongyu Mu, Abudurexiti Reheman, Zhiquan Cao, Yuchun Fan, Bei Li, Yinqiao Li, Tong Xiao, Chunliang Zhang, Jingbo Zhu

Using translation memories (TMs) as prompts is a promising approach to in-context learning of machine translation models.

In-Context Learning Language Modelling +4

Paper
Add Code

Improving End-to-end Speech Translation by Leveraging Auxiliary Speech and Text Data

no code implementations • 4 Dec 2022 • Yuhao Zhang, Chen Xu, Bojie Hu, Chunliang Zhang, Tong Xiao, Jingbo Zhu

We present a method for introducing a text encoder into pre-trained end-to-end speech translation systems.

Denoising Translation

Paper
Add Code

Learning Light-Weight Translation Models from Deep Transformer

1 code implementation • 27 Dec 2020 • Bei Li, Ziyang Wang, Hui Liu, Quan Du, Tong Xiao, Chunliang Zhang, Jingbo Zhu

We proposed a novel group-permutation based knowledge distillation approach to compressing the deep Transformer model into a shallow model.

Knowledge Distillation Machine Translation +2

14

Paper
Code

Improved Differentiable Architecture Search for Language Modeling and Named Entity Recognition

1 code implementation • IJCNLP 2019 • Yufan Jiang, Chi Hu, Tong Xiao, Chunliang Zhang, Jingbo Zhu

In this paper, we study differentiable neural architecture search (NAS) methods for natural language processing.

Ranked #1 on Language Modelling on PTB Diagnostic ECG Database

Language Modelling named-entity-recognition +3

6

Paper
Code

A Hybrid Approach to Skeleton-based Translation

no code implementations • ACL 2014 • Tong Xiao, Jingbo Zhu, Chunliang Zhang

Language Modelling Machine Translation +1

Paper
Add Code

Learning Better Rule Extraction with Translation Span Alignment

no code implementations • ACL 2012 • Jingbo Zhu, Tong Xiao, Chunliang Zhang

Machine Translation Translation +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.