Evaluating and Explaining Natural Language Generation with GenX

NAACL (DaSH) 2021 · Kayla Duskin, Shivam Sharma, Ji Young Yun, Emily Saldanha, Dustin Arendt ·

Current methods for evaluation of natural language generation models focus on measuring text quality but fail to probe the model creativity, i.e., its ability to generate novel but coherent text sequences not seen in the training corpus. We present the GenX tool which is designed to enable interactive exploration and explanation of natural language generation outputs with a focus on the detection of memorization. We demonstrate the utility of the tool on two domain-conditioned generation use cases - phishing emails and ACL abstracts.

PDF Abstract