Improved Deep Learning Baselines for Ubuntu Corpus Dialogs

13 Oct 2015  ·  Rudolf Kadlec, Martin Schmid, Jan Kleindienst ·

This paper presents results of our experiments for the next utterance ranking on the Ubuntu Dialog Corpus -- the largest publicly available multi-turn dialog corpus. First, we use an in-house implementation of previously reported models to do an independent evaluation using the same data. Second, we evaluate the performances of various LSTMs, Bi-LSTMs and CNNs on the dataset. Third, we create an ensemble by averaging predictions of multiple models. The ensemble further improves the performance and it achieves a state-of-the-art result for the next utterance ranking on this dataset. Finally, we discuss our future plans using this corpus.

PDF Abstract

Datasets


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Conversational Response Selection Ubuntu Dialogue (v1, Ranking) Dual-BiLSTM R10@1 0.630 # 24
R10@2 0.780 # 23
R10@5 0.944 # 23
R2@1 0.895 # 12

Methods


No methods listed for this paper. Add relevant methods here