TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Named Entity Recognition (NER)	Ontonotes v5 (English)	Att-BiLSTM-CNN	F1	88.4	# 19
Named Entity Recognition (NER)	Ontonotes v5 (English)	Att-BiLSTM-CNN	Precision	88.71	# 2
Named Entity Recognition (NER)	Ontonotes v5 (English)	Att-BiLSTM-CNN	Recall	88.11	# 2
Named Entity Recognition (NER)	WNUT 2017	Cross-BiLSTM-CNN	F1	42.85	# 19
Named Entity Recognition (NER)	WNUT 2017	Cross-BiLSTM-CNN	Precision	58.28	# 1
Named Entity Recognition (NER)	WNUT 2017	Cross-BiLSTM-CNN	Recall	33.92	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/remedying-bilstm-cnn-deficiency-in-modeling/named-entity-recognition-ner-on-ontonotes-v5)](https://paperswithcode.com/sota/named-entity-recognition-ner-on-ontonotes-v5?p=remedying-bilstm-cnn-deficiency-in-modeling)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/remedying-bilstm-cnn-deficiency-in-modeling/named-entity-recognition-on-wnut-2017)](https://paperswithcode.com/sota/named-entity-recognition-on-wnut-2017?p=remedying-bilstm-cnn-deficiency-in-modeling)`

Why Attention? Analyze BiLSTM Deficiency and Its Remedies in the Case of NER

29 Aug 2019 · Peng-Hsuan Li, Tsu-Jui Fu, Wei-Yun Ma ·

BiLSTM has been prevalently used as a core module for NER in a sequence-labeling setup. State-of-the-art approaches use BiLSTM with additional resources such as gazetteers, language-modeling, or multi-task supervision to further improve NER. This paper instead takes a step back and focuses on analyzing problems of BiLSTM itself and how exactly self-attention can bring improvements. We formally show the limitation of (CRF-)BiLSTM in modeling cross-context patterns for each word -- the XOR limitation. Then, we show that two types of simple cross-structures -- self-attention and Cross-BiLSTM -- can effectively remedy the problem. We test the practical impacts of the deficiency on real-world NER datasets, OntoNotes 5.0 and WNUT 2017, with clear and consistent improvements over the baseline, up to 8.7% on some of the multi-token entity mentions. We give in-depth analyses of the improvements across several aspects of NER, especially the identification of multi-token mentions. This study should lay a sound foundation for future improvements on sequence-labeling NER. (Source codes: https://github.com/jacobvsdanniel/cross-ner)

PDF Abstract

Code

Add Remove Mark official

jacobvsdanniel/cross-ner official

jacobvsdanniel/cross_ner official

ckiplab/ckiptagger

↳ Quickstart in

Colab

1,616

chainwu/ckiptagger-app

Tasks

Add Remove

Named Entity Recognition (NER)

NER

Datasets

OntoNotes 5.0 WNUT 2017

Results from the Paper

Edit

Ranked #19 on Named Entity Recognition (NER) on WNUT 2017

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Named Entity Recognition (NER)	Ontonotes v5 (English)	Att-BiLSTM-CNN	F1	88.4	# 19	Compare
			Precision	88.71	# 2	Compare
			Recall	88.11	# 2	Compare
Named Entity Recognition (NER)	WNUT 2017	Cross-BiLSTM-CNN	F1	42.85	# 19	Compare
			Precision	58.28	# 1	Compare
			Recall	33.92	# 1	Compare

Methods

Add Remove

BiLSTM • LSTM • Sigmoid Activation • Tanh Activation

Edit Social Preview

Why Attention? Analyze BiLSTM Deficiency and Its Remedies in the Case of NER

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove