Remedying BiLSTM-CNN Deficiency in Modeling Cross-Context for NER

29 Aug 2019Peng-Hsuan LiTsu-Jui FuWei-Yun Ma

Recent researches prevalently used BiLSTM-CNN as a core module for NER in a sequence-labeling setup. This paper formally shows the limitation of BiLSTM-CNN encoders in modeling cross-context patterns for each word, i.e., patterns crossing past and future for a specific time step... (read more)

PDF Abstract

Evaluation results from the paper

Task Dataset Model Metric name Metric value Global rank Compare
Named Entity Recognition Long-tail emerging entities Cross-BiLSTM-CNN F1 42.85 # 3
Named Entity Recognition Ontonotes v5 (English) Att-BiLSTM-CNN F1 88.4 # 4