Using Similarity Measures to Select Pretraining Data for NER

NAACL 2019 Xiang DaiSarvnaz KarimiBen HacheyCecile Paris

Word vectors and Language Models (LMs) pretrained on a large amount of unlabelled data can dramatically improve various Natural Language Processing (NLP) tasks. However, the measure and impact of similarity between pretraining data and target task data are left to intuition... (read more)

PDF Abstract

Evaluation results from the paper

Task Dataset Model Metric name Metric value Global rank Compare
Named Entity Recognition JNLPBA BiLSTM-CRF with ELMo F1 74.29 # 5
Named Entity Recognition WetLab BiLSTM-CRF with ELMo F1 79.62 # 1