Open-Domain Dialog

32 papers with code • 1 benchmarks • 11 datasets

This task has no description! Would you like to contribute one?

Benchmarks

Add a Result

These leaderboards are used to track progress in Open-Domain Dialog

Trend	Dataset	Best Model	Paper	Code	Compare
	KILT: Wizard of Wikipedia	Hindsight			See all

Datasets

Subtasks

Dialogue Evaluation

Most implemented papers

Most implemented Social Latest No code

KILT: a Benchmark for Knowledge Intensive Language Tasks

facebookresearch/KILT • NAACL 2021

We test both task-specific and general baselines, evaluating downstream performance in addition to the ability of the models to provide provenance.

Paper
Code

Approximating Interactive Human Evaluation with Self-Play for Open-Domain Dialog Systems

natashamjaques/neural_chat • • NeurIPS 2019

To investigate the strengths of this novel metric and interactive evaluation in comparison to state-of-the-art metrics and human evaluation of static conversations, we perform extended experiments with a set of models, including several that make novel improvements to recent hierarchical dialog generation architectures through sentiment and semantic knowledge distillation on the utterance level.

Paper
Code

Investigating Evaluation of Open-Domain Dialogue Systems With Human Generated Multiple References

prakharguptaz/multirefeval • WS 2019

The aim of this paper is to mitigate the shortcomings of automatic evaluation of open-domain dialog systems through multi-reference evaluation.

Paper
Code

Unsupervised Evaluation of Interactive Dialog with DialoGPT

shikib/fed • • SIGDIAL (ACL) 2020

It is important to define meaningful and interpretable automatic evaluation metrics for open-domain dialog research.

Paper
Code

Dialogue Response Ranking Training with Large-Scale Human Feedback Data

golsun/dialogrpt • • EMNLP 2020

Particularly, our ranker outperforms the conventional dialog perplexity baseline with a large margin on predicting Reddit feedback.

Paper
Code

Hurdles to Progress in Long-form Question Answering

martiansideofthemoon/hurdles-longform-qa • • NAACL 2021

The task of long-form question answering (LFQA) involves retrieving documents relevant to a given question and using them to generate a paragraph-length answer.

Paper
Code

RUBER: An Unsupervised Method for Automatic Evaluation of Open-Domain Dialog Systems

thu-coai/OpenMEVA • • 11 Jan 2017

Open-domain human-computer conversation has been attracting increasing attention over the past few years.

Paper
Code

Augmenting Neural Response Generation with Context-Aware Topical Attention

nouhadziri/THRED • • WS 2019

Our model is built upon the basic Seq2Seq model by augmenting it with a hierarchical joint attention mechanism that incorporates topical concepts and previous interactions into the response generation.

Paper
Code

Evaluating Coherence in Dialogue Systems using Entailment

nouhadziri/DialogEntailment • • NAACL 2019

Evaluating open-domain dialogue systems is difficult due to the diversity of possible correct answers.

Paper
Code

Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in Dialog

natashamjaques/neural_chat • • 30 Jun 2019

Most deep reinforcement learning (RL) systems are not able to learn effectively from off-policy data, especially if they cannot explore online in the environment.

Paper
Code

Open-Domain Dialog

Benchmarks Add a Result

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result