TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Relation Extraction	Adverse Drug Events (ADE) Corpus	PFN (ALBERT XXL, average aggregation)	RE+ Macro F1	83.9	# 2
Relation Extraction	Adverse Drug Events (ADE) Corpus	PFN (ALBERT XXL, average aggregation)	NER Macro F1	91.5	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/an-information-extraction-study-take-in-mind/relation-extraction-on-ade-corpus)](https://paperswithcode.com/sota/relation-extraction-on-ade-corpus?p=an-information-extraction-study-take-in-mind)`

An Information Extraction Study: Take In Mind the Tokenization!

27 Mar 2023 · Christos Theodoropoulos, Marie-Francine Moens ·

Current research on the advantages and trade-offs of using characters, instead of tokenized text, as input for deep learning models, has evolved substantially. New token-free models remove the traditional tokenization step; however, their efficiency remains unclear. Moreover, the effect of tokenization is relatively unexplored in sequence tagging tasks. To this end, we investigate the impact of tokenization when extracting information from documents and present a comparative study and analysis of subword-based and character-based models. Specifically, we study Information Extraction (IE) from biomedical texts. The main outcome is twofold: tokenization patterns can introduce inductive bias that results in state-of-the-art performance, and the character-based models produce promising results; thus, transitioning to token-free IE models is feasible.

PDF Abstract

Code

Add Remove Mark official

christos42/inductive_bias_IE official

Tasks

Add Remove

Inductive Bias

Named Entity Recognition (NER)

Relation Extraction

Datasets

Adverse Drug Events (ADE) Corpus

Results from the Paper

Edit

Ranked #2 on Relation Extraction on Adverse Drug Events (ADE) Corpus

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Relation Extraction	Adverse Drug Events (ADE) Corpus	PFN (ALBERT XXL, average aggregation)	RE+ Macro F1	83.9	# 2		Compare
Relation Extraction	Adverse Drug Events (ADE) Corpus	PFN (ALBERT XXL, average aggregation)	NER Macro F1	91.5	# 2		Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

An Information Extraction Study: Take In Mind the Tokenization!

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove