Named Entity Recognition (NER)

892 papers with code • 76 benchmarks • 122 datasets

Named Entity Recognition (NER) is a task of Natural Language Processing (NLP) that involves identifying and classifying named entities in a text into predefined categories such as person names, organizations, locations, and others. The goal of NER is to extract structured information from unstructured text data and represent it in a machine-readable format. Approaches typically use BIO notation, which differentiates the beginning (B) and the inside (I) of entities. O is used for non-entity tokens.

Example:

Mark	Watney	visited	Mars
B-PER	I-PER	O	B-LOC

( Image credit: Zalando )

Benchmarks

Add a Result

These leaderboards are used to track progress in Named Entity Recognition (NER)

Dataset	Best Model	Compare
CoNLL 2003 (English)	ACE + document-context	See all
Ontonotes v5 (English)	BERT-MRC+DSC	See all
NCBI-disease	Spark NLP	See all
WNUT 2017	CL-KL	See all
ACE 2005	Ours: cross-sentence ALB	See all
BC5CDR	BINDER	See all
JNLPBA	KeBioLM	See all
GENIA	DeepStruct multi-task w/ finetune	See all
BC5CDR-chemical	Spark NLP	See all
SLUE	W2V2-L-LL60K (pipeline approach, uses LM)	See all
BC2GM	Spark NLP	See all
BC5CDR-disease	BioMegatron	See all
ACE 2004	Ours: cross-sentence ALB	See all
CoNLL++	Noise-robust Co-regularization + LUKE	See all
SciERC	SciDeBERTa v2	See all
WNUT 2016	HGN	See all
CoNLL 2003 (German)	ACE + document-context	See all
CoNLL 2002 (Spanish)	ACE + document-context	See all
CoNLL 2002 (Dutch)	ACE + document-context	See all
CoNLL03	UniNER-7B	See all
CoNLL 2003 (German) Revised	FLERT XLM-R	See all
Few-NERD (SUP)	PL-Marker	See all
Species-800	BioKMNER + BioBERT	See all
BC4CHEMD	UniNER-7B	See all
AnatEM	ConNER	See all
CORD-r	TPP (LayoutLMv3)	See all
FUNSD-r	TPP (LayoutLMv3)	See all
LINNAEUS	BLSTM-CNN-Char (SparkNLP)	See all
NEMO-Corpus (morph,test)	AlephBERT-base Pipeline	See all
WNUT 2020	mgsohrab	See all
DWIE	REXEL	See all
FindVehicle	BiLSTM-CRF	See all
BioRED	PubMedBERT-CRF	See all
OntoNotes	DeepStruct multi-task w/ finetune	See all
SemClinBr	pucpr/biobertpt-clin	See all
WLPC	DyGIE	See all
Species800	BLSTM-CNN-Char (SparkNLP)	See all
BioNLP13-CG	BLSTM-CNN-Char (SparkNLP)	See all
NEMO-Corpus (token,test)	AlephBERT-base	See all
CMeEE	BERT-CRF (Replicated in AdaSeq)	See all
HiNER-original	cfilt/HiNER-original-xlm-roberta-large	See all
HiNER-collapsed	cfilt/HiNER-collapsed-xlm-roberta-large	See all
BC7 NLM-Chem	PubMedBERT+MLP+CRF	See all
MasakhaNER	BERT	See all
OntoNotes 5.0	HGN	See all
ACE2005	DeepStruct multi-task w/ finetune	See all
WetLab	BiLSTM-CRF with ELMo	See all
Code-Switching English-Spanish NER	HME (word + BPE + char)	See all
CoNLL 2000	SWEM-CRF	See all
French Treebank	CamemBERT (subword masking)	See all
SoSciSoCi	Bi-LSTM-CRF (SSC->GSC)	See all
LeNER-Br	LSTM-CRF	See all
NCBI Disease	UniNER-7B	See all
DaNE	saattrupdan/nbailab-base-ner-scandi	See all
IECSIL FIRE-2018 Shared Task	XLM-RoBERTa	See all
LegalNERo	Marcell	See all
Adverse Drug Events (ADE) Corpus	Spark NLP	See all
i2b2 De-identification Dataset	BiLSTM with ELMo	See all
Broad Twitter Corpus	WORD_GAZ	See all
Gellus	ConNER	See all
SemEval 2022 - BanglaCoNER	POS Tagger, Prefix, Suffix, k-Neighbor Words, k-means clustering	See all
SemEval 2022-2023 - BanglaCoNER	FT-Bangla BERT Large	See all
NEMO-Corpus	AlephBERTGimmel-base MTL	See all
UNER v1 (Danish)	UNER XML-R	See all
UNER v1 (English)	UNER XML-R	See all
UNER v1 (Croatian)	UNER XML-R	See all
UNER v1 (Portuguese)	UNER XML-R	See all
UNER v1 (Slovak)	UNER XML-R	See all
UNER v1 (Serbian)	UNER XML-R	See all
UNER v1 (Swedish)	UNER XML-R	See all
UNER v1 (Chinese)	UNER XML-R	See all
UNER v1 (Chinese Simplified)	UNER XML-R	See all
UNER v1 - PUD (English)	UNER XML-R	See all
UNER v1 - PUD (Portuguese)	UNER XML-R	See all
UNER v1 - PUD (Swedish)	UNER XML-R	See all
UNER v1 - PUD (Chinese)	UNER XML-R	See all

Show all 76 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Named Entity Recognition (NER) models and implementations

flairNLP/flair

6 papers

13,591

huggingface/transformers

5 papers

126,027

l3cube-pune/MarathiNLP

4 papers

dmlc/gluon-nlp

3 papers

2,551

See all 7 libraries.

Datasets

Subtasks

Few-shot NER

Medical Named Entity Recognition

Multilingual Named Entity Recognition

Cross-Domain Named Entity Recognition

Named Entity Recognition In Vietnamese

Multi-modal Named Entity Recognition

Zero-shot Named Entity Recognition (NER)

Toponym Recognition

Scientific Concept Extraction

Multi-Grained Named Entity Recognition

Most implemented papers

Most implemented Social Latest No code

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

google-research/bert • • NAACL 2019

We introduce a new language representation model called BERT, which stands for Bidirectional Encoder Representations from Transformers.

528

Paper
Code

Deep contextualized word representations

flairNLP/flair • • NAACL 2018

We introduce a new type of deep contextualized word representation that models both (1) complex characteristics of word use (e. g., syntax and semantics), and (2) how these uses vary across linguistic contexts (i. e., to model polysemy).

Paper
Code

Neural Architectures for Named Entity Recognition

glample/tagger • NAACL 2016

State-of-the-art named entity recognition systems rely heavily on hand-crafted features and domain-specific knowledge in order to learn effectively from the small, supervised training corpora that are available.

Paper
Code

Bidirectional LSTM-CRF Models for Sequence Tagging

determined22/zh-ner-tf • • 9 Aug 2015

It can also use sentence level tag information thanks to a CRF layer.

Paper
Code

End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF

guillaumegenthial/sequence_tagging • • ACL 2016

State-of-the-art sequence labeling systems traditionally require large amounts of task-specific knowledge in the form of hand-crafted features and data pre-processing.

Paper
Code

BioBERT: a pre-trained biomedical language representation model for biomedical text mining

dmis-lab/biobert • • 25 Jan 2019

Biomedical text mining is becoming increasingly important as the number of biomedical documents rapidly grows.

Paper
Code

Named Entity Recognition with Bidirectional LSTM-CNNs

flairNLP/flair • • TACL 2016

Named entity recognition is a challenging task that has traditionally required large amounts of knowledge in the form of feature engineering and lexicons to achieve high performance.

Paper
Code

ERNIE: Enhanced Representation through Knowledge Integration

PaddlePaddle/PaddleNLP • • 19 Apr 2019

We present a novel language representation model enhanced by knowledge called ERNIE (Enhanced Representation through kNowledge IntEgration).

Paper
Code

NEZHA: Neural Contextualized Representation for Chinese Language Understanding

PaddlePaddle/PaddleNLP • • 31 Aug 2019

The pre-trained language models have achieved great successes in various natural language understanding (NLU) tasks due to its capacity to capture the deep contextualized information in text by pre-training on large-scale corpora.

Paper
Code

DeBERTa: Decoding-enhanced BERT with Disentangled Attention

microsoft/DeBERTa • • ICLR 2021

Recent progress in pre-trained neural language models has significantly improved the performance of many natural language processing (NLP) tasks.

Paper
Code

Named Entity Recognition (NER)

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Most implemented papers

Content

Benchmarks

Add a Result