VisualMRC (VisualMRC: Machine Reading Comprehension on Document Images)

Introduced by Tanaka et al. in VisualMRC: Machine Reading Comprehension on Document Images

VisualMRC is a visual machine reading comprehension dataset that proposes a task: given a question and a document image, a model produces an abstractive answer.

You can find more details, analyses, and baseline results in the paper, VisualMRC: Machine Reading Comprehension on Document Images, AAAI 2021.

Statistics: 10,197 images 30,562 QA pairs 10.53 average question tokens (tokenizing with NLTK tokenizer) 9.53 average answer tokens (tokenizing wit NLTK tokenizer) 151.46 average OCR tokens (tokenizing with NLTK tokenizer)

Homepage

Benchmarks

Add a new result Link an existing benchmark

No benchmarks yet. Start a new benchmark or link an existing one.

Papers

Paper	Code	Results	Date	Stars

Dataset Loaders

Add Remove

nttmdlab-nlp/VisualMRC

Tasks

Similar Datasets

DUDE

TextCaps

DocCVQA

InfographicVQA

Usage

License

Unknown

Modalities

Images
Texts

Languages

English