TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Coreference Resolution	OntoNotes	Reward Rescaling	F1	65.73	# 23

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/deep-reinforcement-learning-for-mention/coreference-resolution-on-ontonotes)](https://paperswithcode.com/sota/coreference-resolution-on-ontonotes?p=deep-reinforcement-learning-for-mention)`

Deep Reinforcement Learning for Mention-Ranking Coreference Models

EMNLP 2016 · Kevin Clark, Christopher D. Manning ·

Coreference resolution systems are typically trained with heuristic loss functions that require careful tuning. In this paper we instead apply reinforcement learning to directly optimize a neural mention-ranking model for coreference evaluation metrics. We experiment with two approaches: the REINFORCE policy gradient algorithm and a reward-rescaled max-margin objective. We find the latter to be more effective, resulting in significant improvements over the current state-of-the-art on the English and Chinese portions of the CoNLL 2012 Shared Task.