Coreference Resolution

Model Name:*

Description with Markdown (optional):

# Summary

The basic outline of this model is to get an embedded representation of each span in the document. These span representations are scored  and used to prune away spans that are unlikely to occur in a coreference  cluster. For the remaining spans, the model decides which antecedent span (if any) they are coreferent with. The resulting coreference links, after applying transitivity, imply a clustering of the spans in the document. The GloVe embeddings in the original paper have been substituted with SpanBERT embeddings.

[Explore live Coreference Resolution demo at AllenNLP](https://demo.allennlp.org/coreference-resolution/coreference-resolution).

## How do I load this model?

```python
from allennlp_models.pretrained import load_predictor
predictor = load_predictor("coref-spanbert")
```

### Getting predictions

```python
print(predictor.coref_resolved("The trophy doesn't fit in the brown suitcase because it is too big."))
# prints: The trophy doesn't fit in the brown suitcase because The trophy is too big.
```

You can also get predictions using allennlp command line interface:

```shell
echo '{"sentence": "The trophy doesn'\''t fit in the brown suitcase because it is too big."}' | \
    allennlp predict https://storage.googleapis.com/allennlp-public-models/coref-spanbert-large-2020.02.27.tar.gz -
```

## How do I train this model?

To train this model you can use `allennlp` CLI tool and the configuration file [coref_spanbert_large.jsonnet](https://raw.githubusercontent.com/allenai/allennlp-models/v2.1.0/training_config/coref/coref_spanbert_large.jsonnet):

```shell
allennlp train coref_spanbert_large.jsonnet -s output_dir
```

See the [AllenNLP Training and prediction](https://guide.allennlp.org/training-and-prediction#2) guide for more details.

## Citation

```bibtex
@inproceedings{Lee2018HigherorderCR,
 author = {Kenton Lee and Luheng He and L. Zettlemoyer},
 booktitle = {NAACL-HLT},
 title = {Higher-order Coreference Resolution with Coarse-to-fine Inference},
 year = {2018}
}
```

Paper:*

Code URL (optional):

LR	0.0003
Epochs	40

FEEDFORWARD NETWORK

Training Techniques	AdamW
Architecture	BERT, Dropout, Feedforward Network, Layer Normalization, Linear Layer, ReLU, Sigmoid, Tanh
LR	0.0003
Epochs	40
SHOW MORE
SHOW LESS

allenai / allennlp

Summary

How do I load this model?

Getting predictions

How do I train this model?

Citation