BERTScore: Evaluating Text Generation with BERT

ICLR 2020 Tianyi ZhangVarsha KishoreFelix WuKilian Q. WeinbergerYoav Artzi

We propose BERTScore, an automatic evaluation metric for text generation. Analogously to common metrics, BERTScore computes a similarity score for each token in the candidate sentence with each token in the reference sentence... (read more)

PDF Abstract

Results from the Paper


  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.