Reading Comprehension

568 papers with code • 7 benchmarks • 95 datasets

Most current question answering datasets frame the task as reading comprehension where the question is about a paragraph or document and the answer often is a span in the document.

Some specific tasks of reading comprehension include multi-modal machine reading comprehension and textual machine reading comprehension, among others. In the literature, machine reading comprehension can be divide into four categories: cloze style, multiple choice, span prediction, and free-form answer. Read more about each category here.

Benchmark datasets used for testing a model's reading comprehension abilities include MovieQA, ReCoRD, and RACE, among others.

The Machine Reading group at UCL also provides an overview of reading comprehension tasks.

Figure source: A Survey on Machine Reading Comprehension: Tasks, Evaluation Metrics and Benchmark Datasets

Benchmarks

Add a Result

These leaderboards are used to track progress in Reading Comprehension

Dataset	Best Model	Compare
RACE	ALBERT (Ensemble)	See all
ReClor	Rational Reasoner / IDOL	See all
MuSeRC	Golden Transformer	See all
AdversarialQA	RoBERTa-Large	See all
CrowdSource QA	BERT	See all
ReCAM	NAL	See all
RadQA	BERT pretrained on MIMIC-III	See all

Libraries

Use these libraries to find Reading Comprehension models and implementations

huggingface/transformers

7 papers

124,889

facebookresearch/ParlAI

4 papers

10,426

baidu/DuReader

4 papers

1,102

NVIDIA/Megatron-LM

2 papers

8,533

See all 6 libraries.

Datasets

Subtasks

LAMBADA

Question Selection

Multi-Hop Reading Comprehension

Implicatures

Logical Reasoning Reading Comprehension

English Proverbs

Fantasy Reasoning

Figure Of Speech Detection

Formal Fallacies Syllogisms Negation

GRE Reading Comprehension

Hyperbaton

Movie Dialog Same Or Different

Nonsense Words Grammar

Phrase Relatedness

RACE-h

RACE-m

Most implemented papers

Most implemented Social Latest No code

RoBERTa: A Robustly Optimized BERT Pretraining Approach

pytorch/fairseq • • 26 Jul 2019

Language model pretraining has led to significant performance gains but careful comparison between different approaches is challenging.

Paper
Code

Language Models are Few-Shot Learners

openai/gpt-3 • NeurIPS 2020

By contrast, humans can generally perform a new language task from only a few examples or from simple instructions - something which current NLP systems still largely struggle to do.

Paper
Code

Listen, Attend and Spell

Alexander-H-Liu/End-to-end-ASR-Pytorch • • 5 Aug 2015

Unlike traditional DNN-HMM models, this model learns all the components of a speech recognizer jointly.

Paper
Code

Bidirectional Attention Flow for Machine Comprehension

allenai/bi-att-flow • • 5 Nov 2016

Machine comprehension (MC), answering a query about a given context paragraph, requires modeling complex interactions between the context and the query.

Paper
Code

XLNet: Generalized Autoregressive Pretraining for Language Understanding

zihangdai/xlnet • • NeurIPS 2019

With the capability of modeling bidirectional contexts, denoising autoencoding based pretraining like BERT achieves better performance than pretraining approaches based on autoregressive language modeling.

Paper
Code

Towards AI-Complete Question Answering: A Set of Prerequisite Toy Tasks

facebook/bAbI-tasks • • 19 Feb 2015

One long-term goal of machine learning research is to produce methods that are applicable to reasoning and natural language, in particular building an intelligent dialogue agent.

Paper
Code

SQuAD: 100,000+ Questions for Machine Comprehension of Text

worksheets/0xd53d03a4 • EMNLP 2016

We present the Stanford Question Answering Dataset (SQuAD), a new reading comprehension dataset consisting of 100, 000+ questions posed by crowdworkers on a set of Wikipedia articles, where the answer to each question is a segment of text from the corresponding reading passage.

Paper
Code

QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension

BangLiu/QANet-PyTorch • • ICLR 2018

On the SQuAD dataset, our model is 3x to 13x faster in training and 4x to 9x faster in inference, while achieving equivalent accuracy to recurrent models.

Paper
Code

Language Models are Unsupervised Multitask Learners

openai/gpt-2 • • Preprint 2019

Natural language processing tasks, such as question answering, machine translation, reading comprehension, and summarization, are typically approached with supervised learning on taskspecific datasets.

Paper
Code

MS MARCO: A Human Generated MAchine Reading COmprehension Dataset

AmenRa/rank_eval • 28 Nov 2016

The size of the dataset and the fact that the questions are derived from real user search queries distinguishes MS MARCO from other well-known publicly available datasets for machine reading comprehension and question-answering.

Paper
Code

Reading Comprehension

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result