Search Results for author: Wen-tau Yih

Found 95 papers, 42 papers with code

MSR SPLAT, a language analysis toolkit

no code implementations • NAACL 2012 • Chris Quirk, Pallavi Choudhury, Jianfeng Gao, Hisami Suzuki, Kristina Toutanova, Michael Gamon, Wen-tau Yih, Colin Cherry, V, Lucy erwende

Constituency Parsing

Paper
Add Code

Measuring Word Relatedness Using Heterogeneous Vector Space Models

no code implementations • NAACL 2012 • Wen-tau Yih, Vahed Qazvinian

Information Retrieval Natural Language Inference +1

Paper
Add Code

Polarity Inducing Latent Semantic Analysis

no code implementations • EMNLP 2012 • Wen-tau Yih, Geoffrey Zweig, John Platt

Information Retrieval Language Modelling

Paper
Add Code

Dual Coordinate Descent Algorithms for Efficient Large Margin Structured Prediction

no code implementations • TACL 2013 • Ming-Wei Chang, Wen-tau Yih

Due to the nature of complex NLP problems, structured prediction algorithms have been important modeling tools for a wide range of tasks.

Dependency Parsing Document Summarization +7

Paper
Add Code

Linguistic Regularities in Continuous Space Word Representations

no code implementations • NAACL 2013 • Tomas Mikolov, Wen-tau Yih, Geoffrey Zweig

Language Modelling

Paper
Add Code

Combining Heterogeneous Models for Measuring Relational Similarity

no code implementations • NAACL 2013 • Alisa Zhila, Wen-tau Yih, Christopher Meek, Geoffrey Zweig, Tomas Mikolov

Question Answering

Paper
Add Code

Question Answering Using Enhanced Lexical Semantic Models

no code implementations • ACL 2013 • Wen-tau Yih, Ming-Wei Chang, Christopher Meek, Andrzej Pastusiak

Answer Selection Open-Domain Question Answering +1

Paper
Add Code

Multi-Relational Latent Semantic Analysis

no code implementations • EMNLP 2013 • Kai-Wei Chang, Wen-tau Yih, Christopher Meek

Word Sense Disambiguation

Paper
Add Code

Animacy Detection with Voting Models

no code implementations • EMNLP 2013 • Joshua Moore, Christopher J. C. Burges, Erin Renshaw, Wen-tau Yih

Coreference Resolution Dependency Parsing

Paper
Add Code

Learning Semantic Representations for the Phrase Translation Model

no code implementations • 28 Nov 2013 • Jianfeng Gao, Xiaodong He, Wen-tau Yih, Li Deng

The results show that the new semantic-based phrase translation model significantly improves the performance of a state-of-the-art phrase-based statistical machine translation sys-tem, leading to a gain of 0. 7-1. 0 BLEU points.

Learning Semantic Representations Machine Translation +1

Paper
Add Code

Learning Continuous Phrase Representations for Translation Modeling

no code implementations • ACL 2014 • Jianfeng Gao, Xiaodong He, Wen-tau Yih, Li Deng

Machine Translation Translation

Paper
Add Code

Semantic Parsing for Single-Relation Question Answering

no code implementations • ACL 2014 • Wen-tau Yih, Xiaodong He, Christopher Meek

Open-Domain Question Answering Relation +2

Paper
Add Code

Typed Tensor Decomposition of Knowledge Bases for Relation Extraction

no code implementations • EMNLP 2014 • Kai-Wei Chang, Wen-tau Yih, Bishan Yang, Christopher Meek

Relation Relation Extraction +1

Paper
Add Code

Learning Multi-Relational Semantics Using Neural-Embedding Models

no code implementations • 14 Nov 2014 • Bishan Yang, Wen-tau Yih, Xiaodong He, Jianfeng Gao, Li Deng

In this paper we present a unified framework for modeling multi-relational representations, scoring, and learning, and conduct an empirical study of several recent multi-relational embedding models under the framework.

Knowledge Base Completion

Paper
Add Code

Embedding Entities and Relations for Learning and Inference in Knowledge Bases

9 code implementations • 20 Dec 2014 • Bishan Yang, Wen-tau Yih, Xiaodong He, Jianfeng Gao, Li Deng

We consider learning representations of entities and relations in KBs using the neural-embedding approach.

Ranked #10 on Link Prediction on UMLS

Link Prediction

20,062

Paper
Code

Deep Learning and Continuous Representations for Natural Language Processing

no code implementations • HLT 2015 • Wen-tau Yih, Xiaodong He, Jianfeng Gao

Information Retrieval Language Modelling +9

Paper
Add Code

Semantic Parsing via Staged Query Graph Generation: Question Answering with Knowledge Base

1 code implementation • IJCNLP 2015 • Wen-tau Yih, Ming-Wei Chang, Xiaodong He, Jianfeng Gao

Entity Linking Graph Generation +3

112

Paper
Code

WikiQA: A Challenge Dataset for Open-Domain Question Answering

no code implementations • EMNLP 2015 • Yi Yang, Wen-tau Yih, Christopher Meek

Ranked #21 on Question Answering on WikiQA

Answer Selection Open-Domain Question Answering

Paper
Add Code

Reasoning in Vector Space: An Exploratory Study of Question Answering

no code implementations • 19 Nov 2015 • Moontae Lee, Xiaodong He, Wen-tau Yih, Jianfeng Gao, Li Deng, Paul Smolensky

Question answering tasks have shown remarkable progress with distributed vector representation.

Common Sense Reasoning Logical Reasoning +1

Paper
Add Code

Basic Reasoning with Tensor Product Representations

no code implementations • 12 Jan 2016 • Paul Smolensky, Moontae Lee, Xiaodong He, Wen-tau Yih, Jianfeng Gao, Li Deng

In this paper we present the initial development of a general theory for mapping inference in predicate logic to computation over Tensor Product Representations (TPRs; Smolensky (1990), Smolensky & Legendre (2006)).

Question Answering

Paper
Add Code

Question Answering with Knowledge Base, Web and Beyond

no code implementations • NAACL 2016 • Wen-tau Yih, Hao Ma

Question Answering Text Matching

Paper
Add Code

Story Cloze Evaluator: Vector Space Representation Evaluation by Predicting What Happens Next

no code implementations • WS 2016 • Nasrin Mostafazadeh, V, Lucy erwende, Wen-tau Yih, Pushmeet Kohli, James Allen

Representation Learning Semantic Textual Similarity

Paper
Add Code

Compositional Learning of Embeddings for Relation Paths in Knowledge Base and Text

no code implementations • ACL 2016 • Kristina Toutanova, Victoria Lin, Wen-tau Yih, Hoifung Poon, Chris Quirk

Open-Domain Question Answering Relation +1

Paper
Add Code

The Value of Semantic Parse Labeling for Knowledge Base Question Answering

no code implementations • ACL 2016 • Wen-tau Yih, Matthew Richardson, Chris Meek, Ming-Wei Chang, Jina Suh

Ranked #4 on Semantic Parsing on WebQuestionsSP

Knowledge Base Question Answering Semantic Parsing

Paper
Add Code

Learning from Explicit and Implicit Supervision Jointly For Algebra Word Problems

no code implementations • EMNLP 2016 • Shyam Upadhyay, Ming-Wei Chang, Kai-Wei Chang, Wen-tau Yih

Ranked #1 on Math Word Problem Solving on ALG514

Math Word Problem Solving

Paper
Add Code

Answering Complicated Question Intents Expressed in Decomposed Question Sequences

no code implementations • 4 Nov 2016 • Mohit Iyyer, Wen-tau Yih, Ming-Wei Chang

Recent work in semantic parsing for question answering has focused on long and complicated questions, many of which would seem unnatural if asked in a normal conversation between two humans.

Question Answering Semantic Parsing

Paper
Add Code

A Knowledge-Grounded Neural Conversation Model

2 code implementations • 7 Feb 2017 • Marjan Ghazvininejad, Chris Brockett, Ming-Wei Chang, Bill Dolan, Jianfeng Gao, Wen-tau Yih, Michel Galley

We generalize the widely-used Seq2Seq approach by conditioning responses on both conversation history and external "facts", allowing the model to be versatile and applicable in an open-domain setting.

Slot Filling

174

Paper
Code

Search-based Neural Structured Learning for Sequential Question Answering

no code implementations • ACL 2017 • Mohit Iyyer, Wen-tau Yih, Ming-Wei Chang

Recent work in semantic parsing for question answering has focused on long and complicated questions, many of which would seem unnatural if asked in a normal conversation between two humans.

Question Answering Semantic Parsing

Paper
Add Code

NLP for Precision Medicine

no code implementations • ACL 2017 • Hoifung Poon, Chris Quirk, Kristina Toutanova, Wen-tau Yih

We will introduce precision medicine and showcase the vast opportunities for NLP in this burgeoning field with great societal impact.

Decision Making Entity Linking +2

Paper
Add Code

Cross-Sentence N-ary Relation Extraction with Graph LSTMs

no code implementations • TACL 2017 • Nanyun Peng, Hoifung Poon, Chris Quirk, Kristina Toutanova, Wen-tau Yih

Past work in relation extraction has focused on binary relations in single sentences.

Multi-Task Learning Relation +2

Paper
Add Code

Maximum Margin Reward Networks for Learning from Explicit and Implicit Supervision

no code implementations • EMNLP 2017 • Haoruo Peng, Ming-Wei Chang, Wen-tau Yih

Neural networks have achieved state-of-the-art performance on several structured-output prediction tasks, trained in a fully supervised fashion.

Dependency Parsing named-entity-recognition +4

Paper
Add Code

Natural Language to Structured Query Generation via Meta-Learning

1 code implementation • NAACL 2018 • Po-Sen Huang, Chenglong Wang, Rishabh Singh, Wen-tau Yih, Xiaodong He

In conventional supervised training, a model is trained to fit all the training examples.

Ranked #7 on Code Generation on WikiSQL

Meta-Learning

129

Paper
Code

Tracking State Changes in Procedural Text: A Challenge Dataset and Models for Process Paragraph Comprehension

no code implementations • NAACL 2018 • Bhavana Dalvi Mishra, Lifu Huang, Niket Tandon, Wen-tau Yih, Peter Clark

The new dataset, ProPara, is the first to contain natural (rather than machine-generated) text about a changing world along with a full annotation of entity states (location and existence) during those changes (81k datapoints).

Ranked #4 on Procedural Text Understanding on ProPara

Procedural Text Understanding

Paper
Add Code

QuAC : Question Answering in Context

no code implementations • 21 Aug 2018 • Eunsol Choi, He He, Mohit Iyyer, Mark Yatskar, Wen-tau Yih, Yejin Choi, Percy Liang, Luke Zettlemoyer

We present QuAC, a dataset for Question Answering in Context that contains 14K information-seeking QA dialogs (100K questions in total).

Question Answering Reading Comprehension

Paper
Add Code

Dissecting Contextual Word Embeddings: Architecture and Representation

no code implementations • EMNLP 2018 • Matthew E. Peters, Mark Neumann, Luke Zettlemoyer, Wen-tau Yih

Contextual word representations derived from pre-trained bidirectional language models (biLMs) have recently been shown to provide significant improvements to the state of the art for a wide range of NLP tasks.

Word Embeddings

Paper
Add Code

Reasoning about Actions and State Changes by Injecting Commonsense Knowledge

1 code implementation • EMNLP 2018 • Niket Tandon, Bhavana Dalvi Mishra, Joel Grus, Wen-tau Yih, Antoine Bosselut, Peter Clark

Comprehending procedural text, e. g., a paragraph describing photosynthesis, requires modeling actions and the state changes they produce, so that questions about entities at different timepoints can be answered.

Reading Comprehension Structured Prediction

Paper
Code

Policy Shaping and Generalized Update Equations for Semantic Parsing from Denotations

no code implementations • EMNLP 2018 • Dipendra Misra, Ming-Wei Chang, Xiaodong He, Wen-tau Yih

Semantic parsing from denotations faces two key challenges in model training: (1) given only the denotations (e. g., answers), search for good candidate semantic parses, and (2) choose the best model update algorithm.

Question Answering Semantic Parsing

Paper
Add Code

QuAC: Question Answering in Context

no code implementations • EMNLP 2018 • Eunsol Choi, He He, Mohit Iyyer, Mark Yatskar, Wen-tau Yih, Yejin Choi, Percy Liang, Luke Zettlemoyer

We present QuAC, a dataset for Question Answering in Context that contains 14K information-seeking QA dialogs (100K questions in total).

Question Answering Reading Comprehension

Paper
Add Code

FlowQA: Grasping Flow in History for Conversational Machine Comprehension

1 code implementation • ICLR 2019 • Hsin-Yuan Huang, Eunsol Choi, Wen-tau Yih

Conversational machine comprehension requires the understanding of the conversation history, such as previous question/answer pairs, the document context, and the current question.

Ranked #1 on Question Answering on QuAC

Question Answering Reading Comprehension +1

198

Paper
Code

QuaRel: A Dataset and Models for Answering Questions about Qualitative Relationships

no code implementations • 20 Nov 2018 • Oyvind Tafjord, Peter Clark, Matt Gardner, Wen-tau Yih, Ashish Sabharwal

Many natural language questions require recognizing and reasoning with qualitative relationships (e. g., in science, economics, and medicine), but are challenging to answer with corpus-based methods.

Friction Semantic Parsing

Paper
Add Code

Be Consistent! Improving Procedural Text Comprehension using Label Consistency

1 code implementation • NAACL 2019 • Xinya Du, Bhavana Dalvi Mishra, Niket Tandon, Antoine Bosselut, Wen-tau Yih, Peter Clark, Claire Cardie

Our goal is procedural text comprehension, namely tracking how the properties of entities (e. g., their location) change with time given a procedural text (e. g., a paragraph about photosynthesis, a recipe).

Reading Comprehension

Paper
Code

Everything Happens for a Reason: Discovering the Purpose of Actions in Procedural Text

no code implementations • IJCNLP 2019 • Bhavana Dalvi Mishra, Niket Tandon, Antoine Bosselut, Wen-tau Yih, Peter Clark

Our goal is to better comprehend procedural text, e. g., a paragraph about photosynthesis, by not only predicting what happens, but why some actions need to happen before others.

Reading Comprehension

Paper
Add Code

Model-based Interactive Semantic Parsing: A Unified Framework and A Text-to-SQL Case Study

2 code implementations • IJCNLP 2019 • Ziyu Yao, Yu Su, Huan Sun, Wen-tau Yih

As a promising paradigm, interactive semantic parsing has shown to improve both semantic parsing accuracy and user confidence in the results.

Semantic Parsing Text-To-SQL

Paper
Code

Unsupervised Question Decomposition for Question Answering

2 code implementations • EMNLP 2020 • Ethan Perez, Patrick Lewis, Wen-tau Yih, Kyunghyun Cho, Douwe Kiela

We aim to improve question answering (QA) by decomposing hard questions into simpler sub-questions that existing QA systems are capable of answering.

Question Answering

117

Paper
Code

Dense Passage Retrieval for Open-Domain Question Answering

17 code implementations • EMNLP 2020 • Vladimir Karpukhin, Barlas Oğuz, Sewon Min, Patrick Lewis, Ledell Wu, Sergey Edunov, Danqi Chen, Wen-tau Yih

Open-domain question answering relies on efficient passage retrieval to select candidate contexts, where traditional sparse vector space models, such as TF-IDF or BM25, are the de facto method.

Ranked #1 on Question Answering on NaturalQA

Open-Domain Question Answering Passage Retrieval +1

124,593

Paper
Code

An Imitation Game for Learning Semantic Parsers from User Interaction

1 code implementation • EMNLP 2020 • Ziyu Yao, Yiqi Tang, Wen-tau Yih, Huan Sun, Yu Su

Despite the widely successful applications, bootstrapping and fine-tuning semantic parsers are still a tedious process with challenges such as costly data annotation and privacy risks.

Imitation Learning Text-To-SQL

Paper
Code

TaBERT: Pretraining for Joint Understanding of Textual and Tabular Data

1 code implementation • ACL 2020 • Pengcheng Yin, Graham Neubig, Wen-tau Yih, Sebastian Riedel

Recent years have witnessed the burgeoning of pretrained language models (LMs) for text-based natural language (NL) understanding tasks.

Ranked #10 on Text-To-SQL on spider (Exact Match Accuracy (Dev) metric)

Semantic Parsing Text-To-SQL

578

Paper
Code

Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

9 code implementations • NeurIPS 2020 • Patrick Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela

Large pre-trained language models have been shown to store factual knowledge in their parameters, and achieve state-of-the-art results when fine-tuned on downstream NLP tasks.

Ranked #4 on Question Answering on WebQuestions

Fact Verification Question Answering +3

124,593

Paper
Code

Language Models as Fact Checkers?

no code implementations • WS 2020 • Nayeon Lee, Belinda Z. Li, Sinong Wang, Wen-tau Yih, Hao Ma, Madian Khabsa

Recent work has suggested that language models (LMs) store both common-sense and factual knowledge learned from pre-training data.

Common Sense Reasoning Language Modelling +2

Paper
Add Code

Open-Domain Question Answering

no code implementations • ACL 2020 • Danqi Chen, Wen-tau Yih

This tutorial provides a comprehensive and coherent overview of cutting-edge research in open-domain question answering (QA), the task of answering questions using a large collection of documents of diversified topics.

Open-Domain Question Answering

Paper
Add Code

Answering Complex Open-Domain Questions with Multi-Hop Dense Retrieval

1 code implementation • ICLR 2021 • Wenhan Xiong, Xiang Lorraine Li, Srini Iyer, Jingfei Du, Patrick Lewis, William Yang Wang, Yashar Mehdad, Wen-tau Yih, Sebastian Riedel, Douwe Kiela, Barlas Oğuz

We propose a simple and efficient multi-hop dense retrieval approach for answering complex open-domain questions, which achieves state-of-the-art performance on two multi-hop datasets, HotpotQA and multi-evidence FEVER.

Ranked #14 on Question Answering on HotpotQA

Question Answering Retrieval

208

Paper
Code

Efficient One-Pass End-to-End Entity Linking for Questions

3 code implementations • EMNLP 2020 • Belinda Z. Li, Sewon Min, Srinivasan Iyer, Yashar Mehdad, Wen-tau Yih

We present ELQ, a fast end-to-end entity linking model for questions, which uses a biencoder to jointly perform mention detection and linking in one pass.

Entity Linking Question Answering

1,127

Paper
Code

RECONSIDER: Re-Ranking using Span-Focused Cross-Attention for Open Domain Question Answering

1 code implementation • 21 Oct 2020 • Srinivasan Iyer, Sewon Min, Yashar Mehdad, Wen-tau Yih

State-of-the-art Machine Reading Comprehension (MRC) models for Open-domain Question Answering (QA) are typically trained for span selection using distantly supervised positive examples and heuristically retrieved negative examples.

Machine Reading Comprehension Natural Questions +3

Paper
Code

Joint Verification and Reranking for Open Fact Checking Over Tables

no code implementations • ACL 2021 • Michael Schlichtkrull, Vladimir Karpukhin, Barlas Oğuz, Mike Lewis, Wen-tau Yih, Sebastian Riedel

Structured information is an important knowledge source for automatic verification of factual claims.

Fact Checking Retrieval

Paper
Add Code

FiD-Ex: Improving Sequence-to-Sequence Models for Extractive Rationale Generation

no code implementations • EMNLP 2021 • Kushal Lakhotia, Bhargavi Paranjape, Asish Ghoshal, Wen-tau Yih, Yashar Mehdad, Srinivasan Iyer

Natural language (NL) explanations of model predictions are gaining popularity as a means to understand and verify decisions made by large black-box pre-trained models, for NLP tasks such as Question Answering (QA) and Fact Verification.

Fact Verification Question Answering +1

Paper
Add Code

Studying Strategically: Learning to Mask for Closed-book QA

no code implementations • 31 Dec 2020 • Qinyuan Ye, Belinda Z. Li, Sinong Wang, Benjamin Bolte, Hao Ma, Wen-tau Yih, Xiang Ren, Madian Khabsa

Thus, our policy packs task-relevant knowledge into the parameters of a language model.

Language Modelling Question Answering +1

Paper
Add Code

NeurIPS 2020 EfficientQA Competition: Systems, Analyses and Lessons Learned

no code implementations • 1 Jan 2021 • Sewon Min, Jordan Boyd-Graber, Chris Alberti, Danqi Chen, Eunsol Choi, Michael Collins, Kelvin Guu, Hannaneh Hajishirzi, Kenton Lee, Jennimaria Palomaki, Colin Raffel, Adam Roberts, Tom Kwiatkowski, Patrick Lewis, Yuxiang Wu, Heinrich Küttler, Linqing Liu, Pasquale Minervini, Pontus Stenetorp, Sebastian Riedel, Sohee Yang, Minjoon Seo, Gautier Izacard, Fabio Petroni, Lucas Hosseini, Nicola De Cao, Edouard Grave, Ikuya Yamada, Sonse Shimaoka, Masatoshi Suzuki, Shumpei Miyawaki, Shun Sato, Ryo Takahashi, Jun Suzuki, Martin Fajcik, Martin Docekal, Karel Ondrej, Pavel Smrz, Hao Cheng, Yelong Shen, Xiaodong Liu, Pengcheng He, Weizhu Chen, Jianfeng Gao, Barlas Oguz, Xilun Chen, Vladimir Karpukhin, Stan Peshterliev, Dmytro Okhonko, Michael Schlichtkrull, Sonal Gupta, Yashar Mehdad, Wen-tau Yih

We review the EfficientQA competition from NeurIPS 2020.

Open-Domain Question Answering Retrieval

Paper
Add Code

Multi-task Retrieval for Knowledge-Intensive Tasks

no code implementations • ACL 2021 • Jean Maillard, Vladimir Karpukhin, Fabio Petroni, Wen-tau Yih, Barlas Oğuz, Veselin Stoyanov, Gargi Ghosh

Retrieving relevant contexts from a large corpus is a crucial step for tasks such as open-domain question answering and fact checking.

Fact Checking Open-Domain Question Answering +1

Paper
Add Code

On Unifying Misinformation Detection

1 code implementation • NAACL 2021 • Nayeon Lee, Belinda Z. Li, Sinong Wang, Pascale Fung, Hao Ma, Wen-tau Yih, Madian Khabsa

In this paper, we introduce UnifiedM2, a general-purpose misinformation model that jointly models multiple domains of misinformation with a single, unified setup.

Few-Shot Learning Misinformation

Paper
Code

On the Influence of Masking Policies in Intermediate Pre-training

no code implementations • EMNLP 2021 • Qinyuan Ye, Belinda Z. Li, Sinong Wang, Benjamin Bolte, Hao Ma, Wen-tau Yih, Xiang Ren, Madian Khabsa

Current NLP models are predominantly trained through a two-stage "pre-train then fine-tune" pipeline.

Abstractive Text Summarization Language Modelling +4

Paper
Add Code

RECONSIDER: Improved Re-Ranking using Span-Focused Cross-Attention for Open Domain Question Answering

no code implementations • NAACL 2021 • Srinivasan Iyer, Sewon Min, Yashar Mehdad, Wen-tau Yih

Machine Reading Comprehension Natural Questions +3

Paper
Add Code

On the Efficacy of Adversarial Data Collection for Question Answering: Results from a Large-Scale Randomized Study

1 code implementation • ACL 2021 • Divyansh Kaushik, Douwe Kiela, Zachary C. Lipton, Wen-tau Yih

In adversarial data collection (ADC), a human workforce interacts with a model in real time, attempting to produce examples that elicit incorrect predictions.

Question Answering

Paper
Code

Domain-matched Pre-training Tasks for Dense Retrieval

1 code implementation • Findings (NAACL) 2022 • Barlas Oğuz, Kushal Lakhotia, Anchit Gupta, Patrick Lewis, Vladimir Karpukhin, Aleksandra Piktus, Xilun Chen, Sebastian Riedel, Wen-tau Yih, Sonal Gupta, Yashar Mehdad

Pre-training on larger datasets with ever increasing model size is now a proven recipe for increased performance across almost all NLP tasks.

Ranked #2 on Passage Retrieval on Natural Questions (using extra training data)

Passage Retrieval Retrieval

245

Paper
Code

Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a Sparse One?

2 code implementations • 13 Oct 2021 • Xilun Chen, Kushal Lakhotia, Barlas Oğuz, Anchit Gupta, Patrick Lewis, Stan Peshterliev, Yashar Mehdad, Sonal Gupta, Wen-tau Yih

Despite their recent popularity and well-known advantages, dense retrievers still lag behind sparse methods such as BM25 in their ability to reliably match salient phrases and rare entities in the query and to generalize to out-of-domain data.

Ranked #2 on Passage Retrieval on EntityQuestions

Open-Domain Question Answering Passage Retrieval +1

246

Paper
Code

UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning

1 code implementation • ACL 2022 • Yuning Mao, Lambert Mathias, Rui Hou, Amjad Almahairi, Hao Ma, Jiawei Han, Wen-tau Yih, Madian Khabsa

Recent parameter-efficient language model tuning (PELT) methods manage to match the performance of fine-tuning with much fewer trainable parameters and perform especially well when training data is limited.

Language Modelling Model Selection

Paper
Code

CCQA: A New Web-Scale Question Answering Dataset for Model Pre-Training

1 code implementation • Findings (NAACL) 2022 • Patrick Huber, Armen Aghajanyan, Barlas Oğuz, Dmytro Okhonko, Wen-tau Yih, Sonal Gupta, Xilun Chen

Consequently, we propose a novel QA dataset based on the Common Crawl project in this paper.

Open-Domain Question Answering

Paper
Code

Simple Local Attentions Remain Competitive for Long-Context Tasks

1 code implementation • NAACL 2022 • Wenhan Xiong, Barlas Oğuz, Anchit Gupta, Xilun Chen, Diana Liskovich, Omer Levy, Wen-tau Yih, Yashar Mehdad

Many NLP tasks require processing long contexts beyond the length limit of pretrained models.

29,201

Paper
Code

Boosted Dense Retriever

no code implementations • NAACL 2022 • Patrick Lewis, Barlas Oğuz, Wenhan Xiong, Fabio Petroni, Wen-tau Yih, Sebastian Riedel

DrBoost is trained in stages: each component model is learned sequentially and specialized by focusing only on retrieval mistakes made by the current ensemble.

Quantization Retrieval

Paper
Add Code

The Web Is Your Oyster -- Knowledge-Intensive NLP against a Very Large Web Corpus

2 code implementations • 18 Dec 2021 • Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Dmytro Okhonko, Samuel Broscheit, Gautier Izacard, Patrick Lewis, Barlas Oğuz, Edouard Grave, Wen-tau Yih, Sebastian Riedel

In order to address increasing demands of real-world applications, the research for knowledge-intensive NLP (KI-NLP) should advance by capturing the challenges of a truly open-domain environment: web-scale knowledge, lack of structure, inconsistent quality and noise.

Common Sense Reasoning Retrieval

550

Paper
Code

InCoder: A Generative Model for Code Infilling and Synthesis

3 code implementations • 12 Apr 2022 • Daniel Fried, Armen Aghajanyan, Jessy Lin, Sida Wang, Eric Wallace, Freda Shi, Ruiqi Zhong, Wen-tau Yih, Luke Zettlemoyer, Mike Lewis

Our model is the first generative model that is able to directly perform zero-shot code infilling, which we evaluate on challenging tasks such as type inference, comment generation, and variable re-naming.

Ranked #84 on Code Generation on MBPP

Code Generation Comment Generation +1

289

Paper
Code

Improving Passage Retrieval with Zero-Shot Question Generation

1 code implementation • 15 Apr 2022 • Devendra Singh Sachan, Mike Lewis, Mandar Joshi, Armen Aghajanyan, Wen-tau Yih, Joelle Pineau, Luke Zettlemoyer

We propose a simple and effective re-ranking method for improving passage retrieval in open question answering.

Language Modelling Open-Domain Question Answering +6

Paper
Code

DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings

1 code implementation • NAACL 2022 • Yung-Sung Chuang, Rumen Dangovski, Hongyin Luo, Yang Zhang, Shiyu Chang, Marin Soljačić, Shang-Wen Li, Wen-tau Yih, Yoon Kim, James Glass

We propose DiffCSE, an unsupervised contrastive learning framework for learning sentence embeddings.

Ranked #13 on Semantic Textual Similarity on STS16

Contrastive Learning Language Modelling +3

284

Paper
Code

Autoregressive Search Engines: Generating Substrings as Document Identifiers

2 code implementations • 22 Apr 2022 • Michele Bevilacqua, Giuseppe Ottaviano, Patrick Lewis, Wen-tau Yih, Sebastian Riedel, Fabio Petroni

Knowledge-intensive language tasks require NLP systems to both provide the correct answer and retrieve supporting evidence for it in a given corpus.

Information Retrieval Retrieval

269

Paper
Code

On Continual Model Refinement in Out-of-Distribution Data Streams

no code implementations • ACL 2022 • Bill Yuchen Lin, Sida Wang, Xi Victoria Lin, Robin Jia, Lin Xiao, Xiang Ren, Wen-tau Yih

Real-world natural language processing (NLP) models need to be continually updated to fix the prediction errors in out-of-distribution (OOD) data streams while overcoming catastrophic forgetting.

Benchmarking Continual Learning

Paper
Add Code

Structured Prompt Tuning

no code implementations • 24 May 2022 • Chi-Liang Liu, Hung-Yi Lee, Wen-tau Yih

We propose structured prompt tuning, a simple and effective method to improve prompt tuning.

Paper
Add Code

Adapting Pretrained Text-to-Text Models for Long Text Sequences

1 code implementation • 21 Sep 2022 • Wenhan Xiong, Anchit Gupta, Shubham Toshniwal, Yashar Mehdad, Wen-tau Yih

We present an empirical study of adapting an existing pretrained text-to-text model for long-sequence inputs.

Ranked #1 on Text Summarization on QMSum

Long-range modeling Question Answering +1

Paper
Code

RoMQA: A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering

1 code implementation • 25 Oct 2022 • Victor Zhong, Weijia Shi, Wen-tau Yih, Luke Zettlemoyer

Moreover, existing models are not robust to variations in question constraints, but can be made more robust by tuning on clusters of related questions.

Question Answering Retrieval

Paper
Code

Task-aware Retrieval with Instructions

1 code implementation • 16 Nov 2022 • Akari Asai, Timo Schick, Patrick Lewis, Xilun Chen, Gautier Izacard, Sebastian Riedel, Hannaneh Hajishirzi, Wen-tau Yih

We study the problem of retrieval with instructions, where users of a retrieval system explicitly describe their intent along with their queries.

Retrieval

146

Paper
Code

CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval

1 code implementation • 18 Nov 2022 • Minghan Li, Sheng-Chieh Lin, Barlas Oguz, Asish Ghoshal, Jimmy Lin, Yashar Mehdad, Wen-tau Yih, Xilun Chen

In this paper, we unify different multi-vector retrieval models from a token routing viewpoint and propose conditional token interaction via dynamic lexical routing, namely CITADEL, for efficient and effective multi-vector retrieval.

Retrieval

245

Paper
Code

Retrieval-Augmented Multimodal Language Modeling

no code implementations • 22 Nov 2022 • Michihiro Yasunaga, Armen Aghajanyan, Weijia Shi, Rich James, Jure Leskovec, Percy Liang, Mike Lewis, Luke Zettlemoyer, Wen-tau Yih

To integrate knowledge in a more scalable and modular way, we propose a retrieval-augmented multimodal model, which enables a base multimodal model (generator) to refer to relevant text and images fetched by a retriever from external memory (e. g., documents on the web).

Ranked #7 on Image Captioning on MS COCO

Caption Generation Image Captioning +5

Paper
Add Code

Coder Reviewer Reranking for Code Generation

1 code implementation • 29 Nov 2022 • Tianyi Zhang, Tao Yu, Tatsunori B. Hashimoto, Mike Lewis, Wen-tau Yih, Daniel Fried, Sida I. Wang

Sampling diverse programs from a code language model and reranking with model likelihood is a popular method for code generation but it is prone to preferring degenerate solutions.

Ranked #22 on Code Generation on MBPP

Code Generation Language Modelling

Paper
Code

Nonparametric Masked Language Modeling

1 code implementation • 2 Dec 2022 • Sewon Min, Weijia Shi, Mike Lewis, Xilun Chen, Wen-tau Yih, Hannaneh Hajishirzi, Luke Zettlemoyer

Existing language models (LMs) predict tokens with a softmax over a finite vocabulary, which can make it difficult to predict rare tokens or phrases.

Language Modelling Masked Language Modeling +2

153

Paper
Code

One Embedder, Any Task: Instruction-Finetuned Text Embeddings

3 code implementations • 19 Dec 2022 • Hongjin Su, Weijia Shi, Jungo Kasai, Yizhong Wang, Yushi Hu, Mari Ostendorf, Wen-tau Yih, Noah A. Smith, Luke Zettlemoyer, Tao Yu

Our analysis suggests that INSTRUCTOR is robust to changes in instructions, and that instruction finetuning mitigates the challenge of training a single model on diverse datasets.

Information Retrieval Learning Word Embeddings +3

4,005

Paper
Code

REPLUG: Retrieval-Augmented Black-Box Language Models

1 code implementation • 30 Jan 2023 • Weijia Shi, Sewon Min, Michihiro Yasunaga, Minjoon Seo, Rich James, Mike Lewis, Luke Zettlemoyer, Wen-tau Yih

We introduce REPLUG, a retrieval-augmented language modeling framework that treats the language model (LM) as a black box and augments it with a tuneable retrieval model.

Ranked #9 on Question Answering on Natural Questions

Language Modelling Multi-task Language Understanding +2

887

Paper
Code

How to Train Your DRAGON: Diverse Augmentation Towards Generalizable Dense Retrieval

1 code implementation • 15 Feb 2023 • Sheng-Chieh Lin, Akari Asai, Minghan Li, Barlas Oguz, Jimmy Lin, Yashar Mehdad, Wen-tau Yih, Xilun Chen

We hence propose a new DA approach with diverse queries and sources of supervision to progressively train a generalizable DR. As a result, DRAGON, our dense retriever trained with diverse augmentation, is the first BERT-base-sized DR to achieve state-of-the-art effectiveness in both supervised and zero-shot evaluations and even competes with models using more complex late interaction (ColBERTv2 and SPLADE++).

Contrastive Learning Data Augmentation +1

245

Paper
Code

LEVER: Learning to Verify Language-to-Code Generation with Execution

1 code implementation • 16 Feb 2023 • Ansong Ni, Srini Iyer, Dragomir Radev, Ves Stoyanov, Wen-tau Yih, Sida I. Wang, Xi Victoria Lin

The advent of large language models trained on code (code LLMs) has led to significant progress in language-to-code generation.

Ranked #2 on Semantic Parsing on spider

Arithmetic Reasoning Code Generation +3

Paper
Code

VideoOFA: Two-Stage Pre-Training for Video-to-Text Generation

no code implementations • 4 May 2023 • Xilun Chen, Lili Yu, Wenhan Xiong, Barlas Oğuz, Yashar Mehdad, Wen-tau Yih

We propose a new two-stage pre-training framework for video-to-text generation tasks such as video captioning and video question answering: A generative encoder-decoder model is first jointly pre-trained on massive image-text data to learn fundamental vision-language concepts, and then adapted to video data in an intermediate video-text pre-training stage to learn video-specific skills such as spatio-temporal reasoning.

Question Answering Text Generation +3

Paper
Add Code

Large Language Model Programs

no code implementations • 9 May 2023 • Imanol Schlag, Sainbayar Sukhbaatar, Asli Celikyilmaz, Wen-tau Yih, Jason Weston, Jürgen Schmidhuber, Xian Li

In recent years, large pre-trained language models (LLMs) have demonstrated the ability to follow instructions and perform novel tasks from a few examples.

Language Modelling Large Language Model +1

Paper
Add Code

Learning to Simulate Natural Language Feedback for Interactive Semantic Parsing

2 code implementations • 14 May 2023 • Hao Yan, Saurabh Srivastava, Yintao Tai, Sida I. Wang, Wen-tau Yih, Ziyu Yao

In this work, we propose a new task of simulating NL feedback for interactive semantic parsing.

Semantic Parsing Text-To-SQL

Paper
Code

FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation

4 code implementations • 23 May 2023 • Sewon Min, Kalpesh Krishna, Xinxi Lyu, Mike Lewis, Wen-tau Yih, Pang Wei Koh, Mohit Iyyer, Luke Zettlemoyer, Hannaneh Hajishirzi

Evaluating the factuality of long-form text generated by large language models (LMs) is non-trivial because (1) generations often contain a mixture of supported and unsupported pieces of information, making binary judgments of quality inadequate, and (2) human evaluation is time-consuming and costly.

Language Modelling Retrieval +1

265

Paper
Code

Few-Shot Data Synthesis for Open Domain Multi-Hop Question Answering

no code implementations • 23 May 2023 • Mingda Chen, Xilun Chen, Wen-tau Yih

Few-shot learning for open domain multi-hop question answering typically relies on the incontext learning capability of large language models (LLMs).

Fact Verification Few-Shot Learning +2

Paper
Add Code

Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering

1 code implementation • 26 May 2023 • Yung-Sung Chuang, Wei Fang, Shang-Wen Li, Wen-tau Yih, James Glass

We propose EAR, a query Expansion And Reranking approach for improving passage retrieval, with the application to open-domain question answering.

Open-Domain Question Answering Passage Retrieval +1

Paper
Code

Instruction-tuned Language Models are Better Knowledge Learners

no code implementations • 20 Feb 2024 • Zhengbao Jiang, Zhiqing Sun, Weijia Shi, Pedro Rodriguez, Chunting Zhou, Graham Neubig, Xi Victoria Lin, Wen-tau Yih, Srinivasan Iyer

The standard recipe for doing so involves continued pre-training on new documents followed by instruction-tuning on question-answer (QA) pairs.

Language Modelling Large Language Model

Paper
Add Code

Reliable, Adaptable, and Attributable Language Models with Retrieval

no code implementations • 5 Mar 2024 • Akari Asai, Zexuan Zhong, Danqi Chen, Pang Wei Koh, Luke Zettlemoyer, Hannaneh Hajishirzi, Wen-tau Yih

Parametric language models (LMs), which are trained on vast amounts of web data, exhibit remarkable flexibility and capability.

Question Answering Retrieval

Paper
Add Code

Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

1 code implementation • 12 Mar 2024 • Sainbayar Sukhbaatar, Olga Golovneva, Vasu Sharma, Hu Xu, Xi Victoria Lin, Baptiste Rozière, Jacob Kahn, Daniel Li, Wen-tau Yih, Jason Weston, Xian Li

We investigate efficient methods for training Large Language Models (LLMs) to possess capabilities in multiple specialized domains, such as coding, math reasoning and world knowledge.

Ranked #30 on Question Answering on TriviaQA

Arithmetic Reasoning Code Generation +6

176

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.