ConvAI2 Dataset | Papers With Code

Name:*

Full name (optional):

Description (Markdown and $\LaTeX$ enabled):*

The **ConvAI2** NeurIPS competition aimed at finding approaches to creating high-quality dialogue agents capable of meaningful open domain conversation. The ConvAI2 dataset for training models is based on the PERSONA-CHAT dataset. The speaker pairs each have assigned profiles coming from a set of 1155 possible personas (at training time), each consisting of at least 5 profile sentences, setting aside 100 never seen before personas for validation. As the original PERSONA-CHAT test set was released, a new hidden test set consisted of 100 new personas and over 1,015 dialogs was created by crowdsourced workers.

To avoid modeling that takes advantage of trivial word overlap, additional rewritten sets of the same train and test personas were crowdsourced, with related sentences that are rephrases, generalizations or specializations, rendering the task much more challenging. For example “I just got my nails done” is revised as “I love to pamper myself on a regular basis” and “I am on a diet now” is revised as “I need to lose weight.”

The training, validation and hidden test sets consists of 17,878, 1,000 and 1,015 dialogues, respectively.

Source: [The Second Conversational Intelligence Challenge (ConvAI2)](https://paperswithcode.com/paper/the-second-conversational-intelligence/)
Image Source: [The Second Conversational Intelligence Challenge (ConvAI2)](https://paperswithcode.com/paper/the-second-conversational-intelligence/)

Homepage URL (optional):

Paper where the dataset was introduced:

Introduction date:

Dataset license:

URL to full license terms:

Image

Currently

datasets/ConvAI2-0000003063-95759eda.jpg Clear

Change

---

ConvAI2 (Conversational Intelligence Challenge 2)

Benchmarks

Add a new result Link an existing benchmark

Papers

Dataset Loaders

Add Remove

Tasks

Similar Datasets

DailyDialog++

DailyDialog

USR-PersonaChat

FED

Usage

License

Modalities

Languages

ConvAI2 (Conversational Intelligence Challenge 2)

Benchmarks Edit Add a new result Link an existing benchmark