1 code implementation • Findings (NAACL) 2022 • Eileen Wang, Caren Han, Josiah Poon
We measure the reliability of our metric sets by analysing its correlation with human judgement scores on a sample of machine stories obtained from 4 state-of-the-arts models trained on the Visual Storytelling Dataset (VIST).
1 code implementation • 8 May 2022 • Eileen Wang, Caren Han, Josiah Poon
We measure the reliability of our metric sets by analysing its correlation with human judgement scores on a sample of machine stories obtained from 4 state-of-the-arts models trained on the Visual Storytelling Dataset (VIST).