TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
FG-1-PG-1	2010 i2b2/VA	CFNER	F1 (micro)	0.6273	# 1
FG-1-PG-1	2010 i2b2/VA	CFNER	F1 (macro)	0.3626	# 1
FG-1-PG-1	conll2003	CFNER	F1 (micro)	0.8091	# 1
FG-1-PG-1	conll2003	CFNER	F1 (macro)	0.7911	# 1
FG-1-PG-1	OntoNotes 5.0	CFNER	F1 (micro)	0.5894	# 1
FG-1-PG-1	OntoNotes 5.0	CFNER	F1 (macro)	0.4222	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/distilling-causal-effect-from-miscellaneous/fg-1-pg-1-on-2010-i2b2-va)](https://paperswithcode.com/sota/fg-1-pg-1-on-2010-i2b2-va?p=distilling-causal-effect-from-miscellaneous)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/distilling-causal-effect-from-miscellaneous/fg-1-pg-1-on-conll2003)](https://paperswithcode.com/sota/fg-1-pg-1-on-conll2003?p=distilling-causal-effect-from-miscellaneous)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/distilling-causal-effect-from-miscellaneous/fg-1-pg-1-on-ontonotes-5-0)](https://paperswithcode.com/sota/fg-1-pg-1-on-ontonotes-5-0?p=distilling-causal-effect-from-miscellaneous)`

Distilling Causal Effect from Miscellaneous Other-Class for Continual Named Entity Recognition

8 Oct 2022 · Junhao Zheng, Zhanxian Liang, Haibin Chen, Qianli Ma ·

Continual Learning for Named Entity Recognition (CL-NER) aims to learn a growing number of entity types over time from a stream of data. However, simply learning Other-Class in the same way as new entity types amplifies the catastrophic forgetting and leads to a substantial performance drop. The main cause behind this is that Other-Class samples usually contain old entity types, and the old knowledge in these Other-Class samples is not preserved properly. Thanks to the causal inference, we identify that the forgetting is caused by the missing causal effect from the old data. To this end, we propose a unified causal framework to retrieve the causality from both new entity types and Other-Class. Furthermore, we apply curriculum learning to mitigate the impact of label noise and introduce a self-adaptive weight for balancing the causal effects between new entity types and Other-Class. Experimental results on three benchmark datasets show that our method outperforms the state-of-the-art method by a large margin. Moreover, our method can be combined with the existing state-of-the-art methods to improve the performance in CL-NER

PDF Abstract

Code

Add Remove Mark official

zzz47zzz/CFNER official

Tasks

Add Remove

Causal Inference

Continual Learning

Continual Named Entity Recognition

FG-1-PG-1

Miscellaneous

named-entity-recognition

Named Entity Recognition

Named Entity Recognition (NER)

NER

Datasets

CoNLL 2003 OntoNotes 5.0 CoNLL 2010 i2b2/VA

Results from the Paper

Edit

Ranked #1 on FG-1-PG-1 on conll2003

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
FG-1-PG-1	2010 i2b2/VA	CFNER	F1 (micro)	0.6273	# 1	Compare
FG-1-PG-1	2010 i2b2/VA	CFNER	F1 (macro)	0.3626	# 1	Compare
FG-1-PG-1	conll2003	CFNER	F1 (micro)	0.8091	# 1	Compare
FG-1-PG-1	conll2003	CFNER	F1 (macro)	0.7911	# 1	Compare
FG-1-PG-1	OntoNotes 5.0	CFNER	F1 (micro)	0.5894	# 1	Compare
FG-1-PG-1	OntoNotes 5.0	CFNER	F1 (macro)	0.4222	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Distilling Causal Effect from Miscellaneous Other-Class for Continual Named Entity Recognition

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove