WebSRC (WebSRC: A Dataset for Web-Based Structural Reading Comprehension)

Introduced by Chen et al. in WebSRC: A Dataset for Web-Based Structural Reading Comprehension

WebSRC is a novel Web-based Structural Reading Comprehension dataset. It consists of 0.44M question-answer pairs, which are collected from 6.5K web pages with corresponding HTML source code, screenshots and metadata. Each question in WebSRC requires a certain structural understanding of a web page to answer, and the answer is either a text span on the web page or yes/no.

Source: WebSRC Homepage

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks


Similar Datasets


License


Modalities


Languages