Search Results for author: Luyu Gao

Found 20 papers, 14 papers with code

Self-Refine: Iterative Refinement with Self-Feedback

2 code implementations • NeurIPS 2023 • Aman Madaan, Niket Tandon, Prakhar Gupta, Skyler Hallinan, Luyu Gao, Sarah Wiegreffe, Uri Alon, Nouha Dziri, Shrimai Prabhumoye, Yiming Yang, Shashank Gupta, Bodhisattwa Prasad Majumder, Katherine Hermann, Sean Welleck, Amir Yazdanbakhsh, Peter Clark

Motivated by how humans refine their written text, we introduce Self-Refine, an approach for improving initial outputs from LLMs through iterative feedback and refinement.

Mathematical Reasoning Response Generation

1,459

Paper
Code

PAL: Program-aided Language Models

2 code implementations • 18 Nov 2022 • Luyu Gao, Aman Madaan, Shuyan Zhou, Uri Alon, PengFei Liu, Yiming Yang, Jamie Callan, Graham Neubig

Much of this success can be attributed to prompting methods such as "chain-of-thought'', which employ LLMs for both understanding the problem description by decomposing it into steps, as well as solving each step of the problem.

Ranked #17 on Arithmetic Reasoning on GSM8K

Arithmetic Reasoning GSM8K +2

1,168

Paper
Code

Active Retrieval Augmented Generation

1 code implementation • 11 May 2023 • Zhengbao Jiang, Frank F. Xu, Luyu Gao, Zhiqing Sun, Qian Liu, Jane Dwivedi-Yu, Yiming Yang, Jamie Callan, Graham Neubig

In this work, we provide a generalized view of active retrieval augmented generation, methods that actively decide when and what to retrieve across the course of the generation.

Retrieval Sentence

514

Paper
Code

Tevatron: An Efficient and Flexible Toolkit for Dense Retrieval

1 code implementation • 11 Mar 2022 • Luyu Gao, Xueguang Ma, Jimmy Lin, Jamie Callan

In this paper, we present Tevatron, a dense retrieval toolkit optimized for efficiency, flexibility, and code simplicity.

Retrieval

384

Paper
Code

Precise Zero-Shot Dense Retrieval without Relevance Labels

1 code implementation • 20 Dec 2022 • Luyu Gao, Xueguang Ma, Jimmy Lin, Jamie Callan

Given a query, HyDE first zero-shot instructs an instruction-following language model (e. g. InstructGPT) to generate a hypothetical document.

Fact Verification Instruction Following +3

360

Paper
Code

Scaling Deep Contrastive Learning Batch Size under Memory Limited Setup

5 code implementations • ACL (RepL4NLP) 2021 • Luyu Gao, Yunyi Zhang, Jiawei Han, Jamie Callan

Contrastive learning has been applied successfully to learn vector representations of text.

Contrastive Learning

304

Paper
Code

Condenser: a Pre-training Architecture for Dense Retrieval

1 code implementation • EMNLP 2021 • Luyu Gao, Jamie Callan

Pre-trained Transformer language models (LM) have become go-to text representation encoders.

Language Modelling Retrieval +1

237

Paper
Code

Unsupervised Corpus Aware Language Model Pre-training for Dense Passage Retrieval

1 code implementation • ACL 2022 • Luyu Gao, Jamie Callan

Recent research demonstrates the effectiveness of using fine-tuned language models~(LM) for dense retrieval.

Language Modelling Passage Retrieval +1

237

Paper
Code

Rethink Training of BERT Rerankers in Multi-Stage Retrieval Pipeline

1 code implementation • 21 Jan 2021 • Luyu Gao, Zhuyun Dai, Jamie Callan

Pre-trained deep language models~(LM) have advanced the state-of-the-art of text retrieval.

Retrieval Text Retrieval

211

Paper
Code

COIL: Revisit Exact Lexical Match in Information Retrieval with Contextualized Inverted List

1 code implementation • NAACL 2021 • Luyu Gao, Zhuyun Dai, Jamie Callan

Classical information retrieval systems such as BM25 rely on exact lexical match and carry out search efficiently with inverted list index.

Information Retrieval Retrieval

139

Paper
Code

Retrieval as Attention: End-to-end Learning of Retrieval and Reading within a Single Transformer

1 code implementation • 5 Dec 2022 • Zhengbao Jiang, Luyu Gao, Jun Araki, Haibo Ding, Zhiruo Wang, Jamie Callan, Graham Neubig

Systems for knowledge-intensive tasks such as open-domain question answering (QA) usually consist of two stages: efficient retrieval of relevant documents from a large corpus and detailed reading of the selected documents to generate answers.

Ranked #1 on Passage Retrieval on Natural Questions

Open-Domain Question Answering Passage Retrieval +1

Paper
Code

RARR: Researching and Revising What Language Models Say, Using Language Models

1 code implementation • 17 Oct 2022 • Luyu Gao, Zhuyun Dai, Panupong Pasupat, Anthony Chen, Arun Tejasvi Chaganty, Yicheng Fan, Vincent Y. Zhao, Ni Lao, Hongrae Lee, Da-Cheng Juan, Kelvin Guu

Language models (LMs) now excel at many tasks such as few-shot learning, question answering, reasoning, and dialog.

Few-Shot Learning Language Modelling +3

Paper
Code

DataFinder: Scientific Dataset Recommendation from Natural Language Descriptions

1 code implementation • 26 May 2023 • Vijay Viswanathan, Luyu Gao, Tongshuang Wu, PengFei Liu, Graham Neubig

Using this data, we compare various information retrieval algorithms on our test set and present a superior bi-encoder retriever for text-based dataset recommendation.

Information Retrieval Retrieval

Paper
Code

Long Document Re-ranking with Modular Re-ranker

1 code implementation • 9 May 2022 • Luyu Gao, Jamie Callan

In this paper, we propose instead to model full query-to-document interaction, leveraging the attention operation and modular Transformer re-ranker framework.

Document Ranking Re-Ranking

Paper
Code

Modularized Transfomer-based Ranking Framework

no code implementations • EMNLP 2020 • Luyu Gao, Zhuyun Dai, Jamie Callan

Recent innovations in Transformer-based ranking models have advanced the state-of-the-art in information retrieval.

Information Retrieval Retrieval

Paper
Add Code

Complementing Lexical Retrieval with Semantic Residual Embedding

no code implementations • 29 Apr 2020 • Luyu Gao, Zhuyun Dai, Tongfei Chen, Zhen Fan, Benjamin Van Durme, Jamie Callan

This paper presents CLEAR, a retrieval model that seeks to complement classical lexical exact-match models such as BM25 with semantic matching signals from a neural embedding matching model.

Information Retrieval Retrieval

Paper
Add Code

Understanding BERT Rankers Under Distillation

no code implementations • 21 Jul 2020 • Luyu Gao, Zhuyun Dai, Jamie Callan

Deep language models such as BERT pre-trained on large corpus have given a huge performance boost to the state-of-the-art information retrieval ranking systems.

Information Retrieval Retrieval

Paper
Add Code

Improving Target-side Lexical Transfer in Multilingual Neural Machine Translation

no code implementations • Findings of the Association for Computational Linguistics 2020 • Luyu Gao, Xinyi Wang, Graham Neubig

To improve the performance of Neural Machine Translation~(NMT) for low-resource languages~(LRL), one effective strategy is to leverage parallel data from a related high-resource language~(HRL).

Machine Translation NMT +1

Paper
Add Code

A Self-enhancement Approach for Domain-specific Chatbot Training via Knowledge Mining and Digest

no code implementations • 17 Nov 2023 • Ruohong Zhang, Luyu Gao, Chen Zheng, Zhen Fan, Guokun Lai, Zheng Zhang, Fangzhou Ai, Yiming Yang, Hongxia Yang

This paper introduces a novel approach to enhance LLMs by effectively extracting the relevant knowledge from domain-specific textual sources, and the adaptive training of a chatbot with domain-specific inquiries.

Chatbot Text Generation

Paper
Add Code

In-Context Principle Learning from Mistakes

no code implementations • 8 Feb 2024 • Tianjun Zhang, Aman Madaan, Luyu Gao, Steven Zheng, Swaroop Mishra, Yiming Yang, Niket Tandon, Uri Alon

We evaluate LEAP on a wide range of benchmarks, including multi-hop question answering (Hotpot QA), textual QA (DROP), Big-Bench Hard reasoning, and math problems (GSM8K and MATH); in all these benchmarks, LEAP improves the strongest available LLMs such as GPT-3. 5-turbo, GPT-4, GPT-4 turbo and Claude-2. 1.

GSM8K In-Context Learning +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.