DailyDialog++

Introduced by Sai et al. in Improving Dialog Evaluation with a Multi-reference Adversarial Dataset and Large Scale Pretraining

Consists of (i) five relevant responses for each context and (ii) five adversarially crafted irrelevant responses for each context.

Source: Improving Dialog Evaluation with a Multi-reference Adversarial Dataset and Large Scale Pretraining

Homepage

No benchmarks yet. Start a new benchmark or link an existing one.

Paper	Code	Results	Date	Stars

ConvAI2