Coreference Resolution

258 papers with code • 16 benchmarks • 43 datasets

Coreference resolution is the task of clustering mentions in text that refer to the same underlying real world entities.

Example:

               +-----------+
               |           |
I voted for Obama because he was most aligned with my values", she said.
 |                                                 |            |
 +-------------------------------------------------+------------+

"I", "my", and "she" belong to the same cluster and "Obama" and "he" belong to the same cluster.

Benchmarks

Add a Result

These leaderboards are used to track progress in Coreference Resolution

Dataset	Best Model	Compare
Winograd Schema Challenge	PaLM 540B (fine-tuned)	See all
OntoNotes	seq2seq	See all
CoNLL 2012	seq2seq	See all
GAP	Coref-MTL	See all
DWIE	REXEL	See all
STM-coref	BFCR + SpanBERT + Transfer Learning	See all
OntoGUM	MTL-coref	See all
XWinograd EN	mT0-13B	See all
XWinograd FR	mT0-13B	See all
CoNLL12	DeepStruct multi-task w/ finetune	See all
WikiCoref	longdoc S (joint training + PS 30k)	See all
The ARRAU Corpus	dali-full-anaphora	See all
LitBank	longdoc S (joint training)	See all
PreCo	longdoc S (joint training)	See all
Quizbowl	longdoc S (joint training)	See all
DocRED-IE	REXEL	See all

Show all 16 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Coreference Resolution models and implementations

huggingface/transformers

5 papers

124,984

shreyashankar/gpt3-sandbox

3 papers

2,903

dmlc/gluon-nlp

3 papers

2,548

volcengine/vegiantmodel

3 papers

197

See all 25 libraries.

Datasets

Subtasks

Most implemented papers

Most implemented Social Latest No code

Attention Is All You Need

tensorflow/tensor2tensor • • NeurIPS 2017

The dominant sequence transduction models are based on complex recurrent or convolutional neural networks in an encoder-decoder configuration.

567

Paper
Code

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

google-research/bert • • NAACL 2019

We introduce a new language representation model called BERT, which stands for Bidirectional Encoder Representations from Transformers.

528

Paper
Code

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

huggingface/transformers • • arXiv 2019

Transfer learning, where a model is first pre-trained on a data-rich task before being fine-tuned on a downstream task, has emerged as a powerful technique in natural language processing (NLP).

Paper
Code

Language Models are Few-Shot Learners

openai/gpt-3 • NeurIPS 2020

By contrast, humans can generally perform a new language task from only a few examples or from simple instructions - something which current NLP systems still largely struggle to do.

Paper
Code

Deep contextualized word representations

flairNLP/flair • • NAACL 2018

We introduce a new type of deep contextualized word representation that models both (1) complex characteristics of word use (e. g., syntax and semantics), and (2) how these uses vary across linguistic contexts (i. e., to model polysemy).

Paper
Code

Language Models are Unsupervised Multitask Learners

openai/gpt-2 • • Preprint 2019

Natural language processing tasks, such as question answering, machine translation, reading comprehension, and summarization, are typically approached with supervised learning on taskspecific datasets.

Paper
Code

DeBERTa: Decoding-enhanced BERT with Disentangled Attention

microsoft/DeBERTa • • ICLR 2021

Recent progress in pre-trained neural language models has significantly improved the performance of many natural language processing (NLP) tasks.

Paper
Code

SDNet: Contextualized Attention-based Deep Network for Conversational Question Answering

Microsoft/SDNet • • 10 Dec 2018

Conversational question answering (CQA) is a novel QA task that requires understanding of dialogue context.

Paper
Code

SpanBERT: Improving Pre-training by Representing and Predicting Spans

facebookresearch/SpanBERT • • TACL 2020

We present SpanBERT, a pre-training method that is designed to better represent and predict spans of text.

Paper
Code

Scaling Instruction-Finetuned Language Models

google-research/flan • • 20 Oct 2022

We find that instruction finetuning with the above aspects dramatically improves performance on a variety of model classes (PaLM, T5, U-PaLM), prompting setups (zero-shot, few-shot, CoT), and evaluation benchmarks (MMLU, BBH, TyDiQA, MGSM, open-ended generation).

Paper
Code

Coreference Resolution

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result