1 code implementation • LREC 2022 • Thórhildur Thorleiksdóttir, Cedric Renggli, Nora Hollenstein, Ce Zhang
Collecting human judgements is currently the most reliable evaluation method for natural language generation systems.
Text Generation