Search Results for author: Pinzhen Chen

Found 20 papers, 14 papers with code

The University of Edinburgh’s Bengali-Hindi Submissions to the WMT21 News Translation Task

1 code implementation WMT (EMNLP) 2021 Proyag Pal, Alham Fikri Aji, Pinzhen Chen, Sukanta Sen

We describe the University of Edinburgh’s Bengali\leftrightarrowHindi constrained systems submitted to the WMT21 News Translation task.

Translation

Fine-Tuning Large Language Models to Translate: Will a Touch of Noisy Data in Misaligned Languages Suffice?

no code implementations22 Apr 2024 Dawei Zhu, Pinzhen Chen, Miaoran Zhang, Barry Haddow, Xiaoyu Shen, Dietrich Klakow

Traditionally, success in multilingual machine translation can be attributed to three key factors in training data: large volume, diverse translation directions, and high quality.

Lucky 52: How Many Languages Are Needed to Instruction Fine-Tune Large Language Models?

no code implementations7 Apr 2024 Shaoxiong Ji, Pinzhen Chen

Fine-tuning large language models for multilingual downstream tasks requires a diverse set of languages to capture the nuances and structures of different linguistic contexts effectively.

UniArk: Improving Generalisation and Consistency for Factual Knowledge Extraction through Debiasing

1 code implementation1 Apr 2024 Yijun Yang, Jie He, Pinzhen Chen, Víctor Gutiérrez-Basulto, Jeff Z. Pan

We hypothesize that simultaneously debiasing these objectives can be the key to generalisation over unseen prompts.

Fine-tuning Large Language Models with Sequential Instructions

1 code implementation12 Mar 2024 Hanxu Hu, Pinzhen Chen, Edoardo M. Ponti

Targeting the scarcity of sequential instructions in present-day data, we propose sequential instruction tuning, a simple yet effective strategy to automatically augment instruction tuning data and equip LLMs with the ability to execute multiple sequential instructions.

Large Language Model Inference with Lexical Shortlisting

no code implementations16 Nov 2023 Nikolay Bogoychev, Pinzhen Chen, Barry Haddow, Alexandra Birch

Large language model (LLM) inference is computation and memory intensive, so we adapt lexical shortlisting to it hoping to improve both.

Language Modelling Large Language Model +1

Terminology-Aware Translation with Constrained Decoding and Large Language Model Prompting

no code implementations9 Oct 2023 Nikolay Bogoychev, Pinzhen Chen

Terminology correctness is important in the downstream application of machine translation, and a prevalent way to ensure this is to inject terminology constraints into a translation system.

Language Modelling Large Language Model +3

Towards Effective Disambiguation for Machine Translation with Large Language Models

no code implementations20 Sep 2023 Vivek Iyer, Pinzhen Chen, Alexandra Birch

Resolving semantic ambiguity has long been recognised as a central challenge in the field of Machine Translation.

Benchmarking In-Context Learning +3

Monolingual or Multilingual Instruction Tuning: Which Makes a Better Alpaca

1 code implementation16 Sep 2023 Pinzhen Chen, Shaoxiong Ji, Nikolay Bogoychev, Andrey Kutuzov, Barry Haddow, Kenneth Heafield

Foundational large language models (LLMs) can be instruction-tuned to perform open-domain question answering, facilitating applications like chat assistants.

Instruction Following Large Language Model +3

Iterative Translation Refinement with Large Language Models

no code implementations6 Jun 2023 Pinzhen Chen, Zhicheng Guo, Barry Haddow, Kenneth Heafield

In this paper, we propose iterative translation refinement to leverage the power of large language models for more natural translation and post-editing.

Language Modelling Large Language Model +1

Exploring Data Augmentation for Code Generation Tasks

1 code implementation5 Feb 2023 Pinzhen Chen, Gerasimos Lampouras

Advances in natural language processing, such as transfer learning from pre-trained language models, have impacted how models are trained for programming language tasks too.

Code Summarization Code Translation +2

The University of Edinburgh's Submission to the WMT22 Code-Mixing Shared Task (MixMT)

1 code implementation20 Oct 2022 Faheem Kirefu, Vivek Iyer, Pinzhen Chen, Laurie Burchell

For subtask 1 we explored the effects of constrained decoding on English and transliterated subwords in order to produce Hinglish.

Machine Translation Text Generation +1

To Adapt or to Fine-tune: A Case Study on Abstractive Summarization

1 code implementation CCL 2022 Zheng Zhao, Pinzhen Chen

Recent advances in the field of abstractive summarization leverage pre-trained language models rather than train a model from scratch.

Abstractive Text Summarization Language Modelling +1

A Unified Model for Reverse Dictionary and Definition Modelling

1 code implementation9 May 2022 Pinzhen Chen, Zheng Zhao

We build a dual-way neural dictionary to retrieve words given definitions, and produce definitions for queried words.

Definition Modelling Reverse Dictionary

Cannot find the paper you are looking for? You can Submit a new open access paper.