PAQ (Probably Asked Questions)

Introduced by Lewis et al. in PAQ: 65 Million Probably-Asked Questions and What You Can Do With Them

Probably Asked Questions (PAQ) is a very large resource of 65M automatically-generated QA-pairs. PAQ is a semi-structured Knowledge Base (KB) of 65M natural language QA-pairs, which models can memorise and/or learn to retrieve from. PAQ differs from traditional KBs in that questions and answers are stored in natural language, and that questions are generated such that they are likely to appear in ODQA datasets. PAQ is automatically constructed using a question generation model and Wikipedia.

Source: Lewis et al.


Paper Code Results Date Stars

Dataset Loaders


Similar Datasets

Source: Lewis et al..


  • Unknown