Named Entity Recognition (NER)

886 papers with code • 76 benchmarks • 122 datasets

Named Entity Recognition (NER) is a task of Natural Language Processing (NLP) that involves identifying and classifying named entities in a text into predefined categories such as person names, organizations, locations, and others. The goal of NER is to extract structured information from unstructured text data and represent it in a machine-readable format. Approaches typically use BIO notation, which differentiates the beginning (B) and the inside (I) of entities. O is used for non-entity tokens.

Example:

Mark	Watney	visited	Mars
B-PER	I-PER	O	B-LOC

( Image credit: Zalando )

Benchmarks

Add a Result

These leaderboards are used to track progress in Named Entity Recognition (NER)

Dataset	Best Model	Compare
CoNLL 2003 (English)	ACE + document-context	See all
Ontonotes v5 (English)	BERT-MRC+DSC	See all
NCBI-disease	Spark NLP	See all
WNUT 2017	CL-KL	See all
ACE 2005	Ours: cross-sentence ALB	See all
BC5CDR	BINDER	See all
JNLPBA	KeBioLM	See all
GENIA	DeepStruct multi-task w/ finetune	See all
BC5CDR-chemical	Spark NLP	See all
SLUE	W2V2-L-LL60K (pipeline approach, uses LM)	See all
BC2GM	Spark NLP	See all
BC5CDR-disease	BioMegatron	See all
ACE 2004	Ours: cross-sentence ALB	See all
CoNLL++	Noise-robust Co-regularization + LUKE	See all
SciERC	SciDeBERTa v2	See all
WNUT 2016	HGN	See all
CoNLL 2003 (German)	ACE + document-context	See all
CoNLL 2002 (Spanish)	ACE + document-context	See all
CoNLL 2002 (Dutch)	ACE + document-context	See all
CoNLL03	UniNER-7B	See all
CoNLL 2003 (German) Revised	FLERT XLM-R	See all
Few-NERD (SUP)	PL-Marker	See all
Species-800	BioKMNER + BioBERT	See all
BC4CHEMD	UniNER-7B	See all
AnatEM	ConNER	See all
CORD-r	TPP (LayoutLMv3)	See all
FUNSD-r	TPP (LayoutLMv3)	See all
LINNAEUS	BLSTM-CNN-Char (SparkNLP)	See all
NEMO-Corpus (morph,test)	AlephBERT-base Pipeline	See all
WNUT 2020	mgsohrab	See all
DWIE	REXEL	See all
FindVehicle	BiLSTM-CRF	See all
BioRED	PubMedBERT-CRF	See all
OntoNotes	DeepStruct multi-task w/ finetune	See all
SemClinBr	pucpr/biobertpt-clin	See all
WLPC	DyGIE	See all
Species800	BLSTM-CNN-Char (SparkNLP)	See all
BioNLP13-CG	BLSTM-CNN-Char (SparkNLP)	See all
NEMO-Corpus (token,test)	AlephBERT-base	See all
CMeEE	BERT-CRF (Replicated in AdaSeq)	See all
HiNER-original	cfilt/HiNER-original-xlm-roberta-large	See all
HiNER-collapsed	cfilt/HiNER-collapsed-xlm-roberta-large	See all
BC7 NLM-Chem	PubMedBERT+MLP+CRF	See all
MasakhaNER	BERT	See all
OntoNotes 5.0	HGN	See all
ACE2005	DeepStruct multi-task w/ finetune	See all
WetLab	BiLSTM-CRF with ELMo	See all
Code-Switching English-Spanish NER	HME (word + BPE + char)	See all
CoNLL 2000	SWEM-CRF	See all
French Treebank	CamemBERT (subword masking)	See all
SoSciSoCi	Bi-LSTM-CRF (SSC->GSC)	See all
LeNER-Br	LSTM-CRF	See all
NCBI Disease	UniNER-7B	See all
DaNE	saattrupdan/nbailab-base-ner-scandi	See all
IECSIL FIRE-2018 Shared Task	XLM-RoBERTa	See all
LegalNERo	Marcell	See all
Adverse Drug Events (ADE) Corpus	Spark NLP	See all
i2b2 De-identification Dataset	BiLSTM with ELMo	See all
Broad Twitter Corpus	WORD_GAZ	See all
Gellus	ConNER	See all
SemEval 2022 - BanglaCoNER	POS Tagger, Prefix, Suffix, k-Neighbor Words, k-means clustering	See all
SemEval 2022-2023 - BanglaCoNER	FT-Bangla BERT Large	See all
NEMO-Corpus	AlephBERTGimmel-base MTL	See all
UNER v1 (Danish)	UNER XML-R	See all
UNER v1 (English)	UNER XML-R	See all
UNER v1 (Croatian)	UNER XML-R	See all
UNER v1 (Portuguese)	UNER XML-R	See all
UNER v1 (Slovak)	UNER XML-R	See all
UNER v1 (Serbian)	UNER XML-R	See all
UNER v1 (Swedish)	UNER XML-R	See all
UNER v1 (Chinese)	UNER XML-R	See all
UNER v1 (Chinese Simplified)	UNER XML-R	See all
UNER v1 - PUD (English)	UNER XML-R	See all
UNER v1 - PUD (Portuguese)	UNER XML-R	See all
UNER v1 - PUD (Swedish)	UNER XML-R	See all
UNER v1 - PUD (Chinese)	UNER XML-R	See all

Show all 76 benchmarks

Collapse benchmarks

Libraries

Use these libraries to find Named Entity Recognition (NER) models and implementations

flairNLP/flair

6 papers

13,565

huggingface/transformers

5 papers

124,984

l3cube-pune/MarathiNLP

4 papers

dmlc/gluon-nlp

3 papers

2,548

See all 7 libraries.

Datasets

Subtasks

Few-shot NER

Medical Named Entity Recognition

Multilingual Named Entity Recognition

Cross-Domain Named Entity Recognition

Named Entity Recognition In Vietnamese

Multi-modal Named Entity Recognition

Zero-shot Named Entity Recognition (NER)

Toponym Recognition

Scientific Concept Extraction

Multi-Grained Named Entity Recognition

Latest papers with no code

Most implemented Social Latest No code

Do "English" Named Entity Recognizers Work Well on Global Englishes?

no code yet • 20 Apr 2024

We test widely used NER toolkits and transformer models, including models using the pre-trained contextual models RoBERTa and ELECTRA, on three datasets: a commonly used British English newswire dataset, CoNLL 2003, a more American focused dataset OntoNotes, and our global dataset.

Paper
Add Code

Few-shot Name Entity Recognition on StackOverflow

no code yet • 15 Apr 2024

StackOverflow, with its vast question repository and limited labeled examples, raise an annotation challenge for us.

Paper
Add Code

ToNER: Type-oriented Named Entity Recognition with Generative Language Model

no code yet • 14 Apr 2024

In recent years, the fine-tuned generative models have been proven more powerful than the previous tagging-based or span-based models on named entity recognition (NER) task.

Paper
Add Code

LLMs in Biomedicine: A study on clinical Named Entity Recognition

no code yet • 10 Apr 2024

Large Language Models (LLMs) demonstrate remarkable versatility in various NLP tasks but encounter distinct challenges in biomedicine due to medical language complexities and data scarcity.

Paper
Add Code

Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding

no code yet • 8 Apr 2024

Recent advances in natural language processing (NLP) can be largely attributed to the advent of pre-trained language models such as BERT and RoBERTa.

Paper
Add Code

Enhancing Software-Related Information Extraction via Single-Choice Question Answering with Large Language Models

no code yet • 8 Apr 2024

This paper describes our participation in the Shared Task on Software Mentions Disambiguation (SOMD), with a focus on improving relation extraction in scholarly texts through generative Large Language Models (LLMs) using single-choice question-answering.

Paper
Add Code

How much reliable is ChatGPT's prediction on Information Extraction under Input Perturbations?

no code yet • 7 Apr 2024

In this paper, we assess the robustness (reliability) of ChatGPT under input perturbations for one of the most fundamental tasks of Information Extraction (IE) i. e. Named Entity Recognition (NER).

Paper
Add Code

SCANNER: Knowledge-Enhanced Approach for Robust Multi-modal Named Entity Recognition of Unseen Entities

no code yet • 2 Apr 2024

Our approach demonstrates competitive performance on the NER benchmark and surpasses existing methods on both MNER and GMNER benchmarks.

Paper
Add Code

Utilizing AI and Social Media Analytics to Discover Adverse Side Effects of GLP-1 Receptor Agonists

no code yet • 1 Apr 2024

Adverse side effects (ASEs) of drugs, revealed after FDA approval, pose a threat to patient safety.

Paper
Add Code

Augmenting NER Datasets with LLMs: Towards Automated and Refined Annotation

no code yet • 30 Mar 2024

In the field of Natural Language Processing (NLP), Named Entity Recognition (NER) is recognized as a critical technology, employed across a wide array of applications.

Paper
Add Code

Named Entity Recognition (NER)

Benchmarks Add a Result

Libraries

Datasets

Subtasks

Latest papers with no code

Content

Benchmarks

Add a Result