Named Entity Recognition for Entity Linking: What Works and What’s Next

Entity Linking (EL) systems have achieved impressive results on standard benchmarks mainly thanks to the contextualized representations provided by recent pretrained language models. However, such systems still require massive amounts of data – millions of labeled examples – to perform at their best, with training times that often exceed several days, especially when limited computational resources are available. In this paper, we look at how Named Entity Recognition (NER) can be exploited to narrow the gap between EL systems trained on high and low amounts of labeled data. More specifically, we show how and to what extent an EL system can benefit from NER to enhance its entity representations, improve candidate selection, select more effective negative samples and enforce hard and soft constraints on its output entities. We release our software – code and model checkpoints – at https://github.com/Babelscape/ner4el.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Entity Disambiguation ACE2004 NER4EL Micro-F1 91.3 # 4
Entity Disambiguation AIDA-CoNLL NER4EL In-KB Accuracy 92.5 # 13
Entity Disambiguation AQUAINT NER4EL Micro-F1 69.5 # 6
Entity Disambiguation MSNBC NER4EL Micro-F1 89.2 # 6
Entity Disambiguation WNED-CWEB NER4EL Micro-F1 68.5 # 7
Entity Disambiguation WNED-WIKI NER4EL Micro-F1 64.0 # 7

Methods