Search Results for author: Yixuan Su

Found 31 papers, 21 papers with code

500xCompressor: Generalized Prompt Compression for Large Language Models

1 code implementation6 Aug 2024 Zongqian Li, Yixuan Su, Nigel Collier

Prompt compression is crucial for enhancing inference speed, reducing costs, and improving user experience.

Language Modelling Large Language Model +1

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

no code implementations29 Apr 2024 Pat Verga, Sebastian Hofstatter, Sophia Althammer, Yixuan Su, Aleksandra Piktus, Arkady Arkhangorodsky, Minjie Xu, Naomi White, Patrick Lewis

As Large Language Models (LLMs) have become more advanced, they have outpaced our abilities to accurately evaluate their quality.

Unlocking Structure Measuring: Introducing PDD, an Automatic Metric for Positional Discourse Coherence

1 code implementation15 Feb 2024 Yinhong Liu, Yixuan Su, Ehsan Shareghi, Nigel Collier

Recent large language models (LLMs) have shown remarkable performance in aligning generated text with user intentions across various tasks.

Coherence Evaluation Text Generation

Instruct-SCTG: Guiding Sequential Controlled Text Generation through Instructions

no code implementations19 Dec 2023 Yinhong Liu, Yixuan Su, Ehsan Shareghi, Nigel Collier

Instruction-tuned large language models have shown remarkable performance in aligning generated text with user intentions across various tasks.

Text Generation

Specialist or Generalist? Instruction Tuning for Specific NLP Tasks

no code implementations23 Oct 2023 Chufan Shi, Yixuan Su, Cheng Yang, Yujiu Yang, Deng Cai

Although instruction tuning has proven to be a data-efficient method for transforming LLMs into such generalist models, their performance still lags behind specialist models trained exclusively for specific tasks.

Specificity

Sparkles: Unlocking Chats Across Multiple Images for Multimodal Instruction-Following Models

1 code implementation31 Aug 2023 Yupan Huang, Zaiqiao Meng, Fangyu Liu, Yixuan Su, Nigel Collier, Yutong Lu

Furthermore, we construct SparklesEval, a GPT-assisted benchmark for quantitatively assessing a model's conversational competence across multiple images and dialogue turns.

Instruction Following Visual Reasoning

PandaGPT: One Model To Instruction-Follow Them All

1 code implementation25 May 2023 Yixuan Su, Tian Lan, Huayang Li, Jialu Xu, Yan Wang, Deng Cai

To do so, PandaGPT combines the multimodal encoders from ImageBind and the large language models from Vicuna.

Instruction Following

Biomedical Named Entity Recognition via Dictionary-based Synonym Generalization

1 code implementation22 May 2023 Zihao Fu, Yixuan Su, Zaiqiao Meng, Nigel Collier

To alleviate the need of human effort, dictionary-based approaches have been proposed to extract named entities simply based on a given dictionary.

named-entity-recognition Named Entity Recognition

COFFEE: A Contrastive Oracle-Free Framework for Event Extraction

1 code implementation25 Mar 2023 Meiru Zhang, Yixuan Su, Zaiqiao Meng, Zihao Fu, Nigel Collier

In this study, we consider a more realistic setting of this task, namely the Oracle-Free Event Extraction (OFEE) task, where only the input context is given without any oracle information, including event type, event ontology and trigger word.

Event Extraction

Plug-and-Play Recipe Generation with Content Planning

no code implementations9 Dec 2022 Yinhong Liu, Yixuan Su, Ehsan Shareghi, Nigel Collier

Specifically, it optimizes the joint distribution of the natural language sequence and the global content plan in a plug-and-play manner.

Recipe Generation Sentence +1

Momentum Decoding: Open-ended Text Generation As Graph Exploration

1 code implementation5 Dec 2022 Tian Lan, Yixuan Su, Shuhang Liu, Heyan Huang, Xian-Ling Mao

In this study, we formulate open-ended text generation from a new perspective, i. e., we view it as an exploration process within a directed graph.

Text Generation

An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation

3 code implementations19 Nov 2022 Yixuan Su, Jialu Xu

In the study, we empirically compare the two recently proposed decoding methods, i. e. Contrastive Search (CS) and Contrastive Decoding (CD), for open-ended text generation.

Diversity Text Generation

Contrastive Search Is What You Need For Neural Text Generation

3 code implementations25 Oct 2022 Yixuan Su, Nigel Collier

Based on our findings, we further assess the contrastive search decoding method using off-the-shelf LMs on four generation tasks across 16 languages.

Contrastive Learning Language Modelling +1

From Easy to Hard: A Dual Curriculum Learning Framework for Context-Aware Document Ranking

1 code implementation22 Aug 2022 Yutao Zhu, Jian-Yun Nie, Yixuan Su, Haonan Chen, Xinyu Zhang, Zhicheng Dou

In this work, we propose a curriculum learning framework for context-aware document ranking, in which the ranking model learns matching signals between the search context and the candidate document in an easy-to-hard manner.

Document Ranking

Language Models Can See: Plugging Visual Controls in Text Generation

1 code implementation5 May 2022 Yixuan Su, Tian Lan, Yahui Liu, Fangyu Liu, Dani Yogatama, Yan Wang, Lingpeng Kong, Nigel Collier

MAGIC is a flexible framework and is theoretically compatible with any text generation tasks that incorporate image grounding.

Image Captioning Image-text matching +3

A Contrastive Framework for Neural Text Generation

2 code implementations13 Feb 2022 Yixuan Su, Tian Lan, Yan Wang, Dani Yogatama, Lingpeng Kong, Nigel Collier

Text generation is of great importance to many natural language processing applications.

Diversity Text Generation

A Survey on Retrieval-Augmented Text Generation

no code implementations2 Feb 2022 Huayang Li, Yixuan Su, Deng Cai, Yan Wang, Lemao Liu

Recently, retrieval-augmented text generation attracted increasing attention of the computational linguistics community.

Machine Translation Response Generation +3

Rewire-then-Probe: A Contrastive Recipe for Probing Biomedical Knowledge of Pre-trained Language Models

1 code implementation ACL 2022 Zaiqiao Meng, Fangyu Liu, Ehsan Shareghi, Yixuan Su, Charlotte Collins, Nigel Collier

To catalyse the research in this direction, we release a well-curated biomedical knowledge probing benchmark, MedLAMA, which is constructed based on the Unified Medical Language System (UMLS) Metathesaurus.

Knowledge Probing Transfer Learning

Exploring Dense Retrieval for Dialogue Response Selection

1 code implementation13 Oct 2021 Tian Lan, Deng Cai, Yan Wang, Yixuan Su, Heyan Huang, Xian-Ling Mao

In this study, we present a solution to directly select proper responses from a large corpus or even a nonparallel corpus that only consists of unpaired sentences, using a dense retrieval model.

Conversational Response Selection Retrieval

Plan-then-Generate: Controlled Data-to-Text Generation via Planning

2 code implementations Findings (EMNLP) 2021 Yixuan Su, David Vandyke, Sihui Wang, Yimai Fang, Nigel Collier

However, the lack of ability of neural models to control the structure of generated output can be limiting in certain real-world applications.

Data-to-Text Generation Diversity +1

Dialogue Response Selection with Hierarchical Curriculum Learning

1 code implementation ACL 2021 Yixuan Su, Deng Cai, Qingyu Zhou, Zibo Lin, Simon Baker, Yunbo Cao, Shuming Shi, Nigel Collier, Yan Wang

As for IC, it progressively strengthens the model's ability in identifying the mismatching information between the dialogue context and a response candidate.

Conversational Response Selection

Prototype-to-Style: Dialogue Generation with Style-Aware Editing on Retrieval Memory

no code implementations5 Apr 2020 Yixuan Su, Yan Wang, Simon Baker, Deng Cai, Xiaojiang Liu, Anna Korhonen, Nigel Collier

A stylistic response generator then takes the prototype and the desired language style as model input to obtain a high-quality and stylistic response.

Dialogue Generation Information Retrieval +1

Stylistic Dialogue Generation via Information-Guided Reinforcement Learning Strategy

no code implementations5 Apr 2020 Yixuan Su, Deng Cai, Yan Wang, Simon Baker, Anna Korhonen, Nigel Collier, Xiaojiang Liu

To enable better balance between the content quality and the style, we introduce a new training strategy, know as Information-Guided Reinforcement Learning (IG-RL).

Dialogue Generation reinforcement-learning +3

Cannot find the paper you are looking for? You can Submit a new open access paper.