TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Named Entity Recognition (NER)	BC5CDR	ELECTRAMed	F1	90.03	# 7
Relation Extraction	ChemProt	ELECTRAMed	F1	72.94	# 11
Drug–drug Interaction Extraction	DDI extraction 2013 corpus	ELECTRAMed	Micro F1	79.13	# 6
Named Entity Recognition (NER)	NCBI-disease	ELECTRAMed	F1	87.54	# 20

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/electramed-a-new-pre-trained-language/drug-drug-interaction-extraction-on-ddi)](https://paperswithcode.com/sota/drug-drug-interaction-extraction-on-ddi?p=electramed-a-new-pre-trained-language)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/electramed-a-new-pre-trained-language/named-entity-recognition-ner-on-bc5cdr)](https://paperswithcode.com/sota/named-entity-recognition-ner-on-bc5cdr?p=electramed-a-new-pre-trained-language)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/electramed-a-new-pre-trained-language/relation-extraction-on-chemprot)](https://paperswithcode.com/sota/relation-extraction-on-chemprot?p=electramed-a-new-pre-trained-language)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/electramed-a-new-pre-trained-language/named-entity-recognition-ner-on-ncbi-disease)](https://paperswithcode.com/sota/named-entity-recognition-ner-on-ncbi-disease?p=electramed-a-new-pre-trained-language)`

ELECTRAMed: a new pre-trained language representation model for biomedical NLP

19 Apr 2021 · Giacomo Miolo, Giulio Mantoan, Carlotta Orsenigo ·

The overwhelming amount of biomedical scientific texts calls for the development of effective language models able to tackle a wide range of biomedical natural language processing (NLP) tasks. The most recent dominant approaches are domain-specific models, initialized with general-domain textual data and then trained on a variety of scientific corpora. However, it has been observed that for specialized domains in which large corpora exist, training a model from scratch with just in-domain knowledge may yield better results. Moreover, the increasing focus on the compute costs for pre-training recently led to the design of more efficient architectures, such as ELECTRA. In this paper, we propose a pre-trained domain-specific language model, called ELECTRAMed, suited for the biomedical field. The novel approach inherits the learning framework of the general-domain ELECTRA architecture, as well as its computational advantages. Experiments performed on benchmark datasets for several biomedical NLP tasks support the usefulness of ELECTRAMed, which sets the novel state-of-the-art result on the BC5CDR corpus for named entity recognition, and provides the best outcome in 2 over the 5 runs of the 7th BioASQ-factoid Challange for the question answering task.

PDF Abstract

Code

Add Remove Mark official

gmpoli/electramed official

Tasks

Add Remove

Drug–drug Interaction Extraction

Language Modelling

Medical Named Entity Recognition

named-entity-recognition

Named Entity Recognition

Named Entity Recognition (NER)

Question Answering

Relation Extraction

Datasets

BC5CDR

BioASQ NCBI Disease BLUE

DDI ChemProt

Results from the Paper

Add Remove

Ranked #6 on Drug–drug Interaction Extraction on DDI extraction 2013 corpus (Micro F1 metric, using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Named Entity Recognition (NER)	BC5CDR	ELECTRAMed	F1	90.03	# 7	Compare
Relation Extraction	ChemProt	ELECTRAMed	F1	72.94	# 11	Compare
Drug–drug Interaction Extraction	DDI extraction 2013 corpus	ELECTRAMed	Micro F1	79.13	# 6	Compare
Named Entity Recognition (NER)	NCBI-disease	ELECTRAMed	F1	87.54	# 20	Compare

Methods

Add Remove

Adam • Attention Dropout • Dense Connections • Dropout • ELECTRA • GELU • Layer Normalization • Linear Layer • Linear Warmup With Linear Decay • Multi-Head Attention • Residual Connection • Scaled Dot-Product Attention • Softmax • Weight Decay • WordPiece

Edit Social Preview

ELECTRAMed: a new pre-trained language representation model for biomedical NLP

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove