NegBERT: A Transfer Learning Approach for Negation Detection and Scope Resolution

LREC 2020  ·  Aditya Khandelwal, Suraj Sawant ·

Negation is an important characteristic of language, and a major component of information extraction from text. This subtask is of considerable importance to the biomedical domain. Over the years, multiple approaches have been explored to address this problem: Rule-based systems, Machine Learning classifiers, Conditional Random Field Models, CNNs and more recently BiLSTMs. In this paper, we look at applying Transfer Learning to this problem. First, we extensively review previous literature addressing Negation Detection and Scope Resolution across the 3 datasets that have gained popularity over the years: the BioScope Corpus, the Sherlock dataset, and the SFU Review Corpus. We then explore the decision choices involved with using BERT, a popular transfer learning model, for this task, and report state-of-the-art results for scope resolution across all 3 datasets. Our model, referred to as NegBERT, achieves a token level F1 score on scope resolution of 92.36 on the Sherlock dataset, 95.68 on the BioScope Abstracts subcorpus, 91.24 on the BioScope Full Papers subcorpus, 90.95 on the SFU Review Corpus, outperforming the previous state-of-the-art systems by a significant margin. We also analyze the model's generalizability to datasets on which it is not trained.

PDF Abstract LREC 2020 PDF LREC 2020 Abstract

Datasets


Results from the Paper


Ranked #2 on Negation Scope Resolution on SFU Review Corpus (using extra training data)

     Get a GitHub badge
Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Benchmark
Negation Scope Resolution BioScope : Abstracts NegBERT F1 95.68 # 3
Negation Scope Resolution BioScope : Full Papers NegBERT F1 91.24 # 2
Negation and Speculation Cue Detection *sem 2012 Shared Task: Sherlock Dataset NegBERT F1 92.94 # 2
Negation Scope Resolution *sem 2012 Shared Task: Sherlock Dataset NegBERT F1 92.36 # 2
Negation Scope Resolution SFU Review Corpus NegBERT F1 90.95 # 2

Methods