Reducing Sentiment Bias in Language Models via Counterfactual Evaluation

ICLR 2020 Po-Sen HuangHuan ZhangRay JiangRobert StanforthJohannes WelblJack RaeVishal MainiDani YogatamaPushmeet Kohli

Recent advances in language model architectures and the availability of large text corpora have driven progress on automatic text generation. While this results in models that are capable of generating coherent texts, it also prompts models to internalize social biases present in the training corpus... (read more)

PDF Abstract

Results from the Paper

  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.