TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Classification of toxic, engaging, fact-claiming comments	GermEval	GBERT/GELECTRA Ensemble	Macro-F1	72.7	# 1
Engaging Comment Classification	GermEval 2021 - Engaging Comments test set	GBERT/GELECTRA Ensemble	F1	69.9	# 1
Fact-Claiming Comment Classification	GermEval 2021 - Fact-Claiming Comments test set	GBERT/GELECTRA Ensemble	F1	76.8	# 1
Toxic Comment Classification	GermEval 2021 - Toxic Comments test set	GBERT/GELECTRA Ensemble	F1	71.8	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fhac-at-germeval-2021-identifying-german/classification-of-toxic-engaging-fact)](https://paperswithcode.com/sota/classification-of-toxic-engaging-fact?p=fhac-at-germeval-2021-identifying-german)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fhac-at-germeval-2021-identifying-german/engaging-comment-classification-on-germeval-1)](https://paperswithcode.com/sota/engaging-comment-classification-on-germeval-1?p=fhac-at-germeval-2021-identifying-german)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fhac-at-germeval-2021-identifying-german/fact-claiming-comment-classification-on-1)](https://paperswithcode.com/sota/fact-claiming-comment-classification-on-1?p=fhac-at-germeval-2021-identifying-german)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/fhac-at-germeval-2021-identifying-german/toxic-comment-classification-on-germeval-2021-1)](https://paperswithcode.com/sota/toxic-comment-classification-on-germeval-2021-1?p=fhac-at-germeval-2021-identifying-german)`

FHAC at GermEval 2021: Identifying German toxic, engaging, and fact-claiming comments with ensemble learning

GermEval 2021 · Tobias Bornheim, Niklas Grieger, Stephan Bialonski ·

The availability of language representations learned by large pretrained neural network models (such as BERT and ELECTRA) has led to improvements in many downstream Natural Language Processing tasks in recent years. Pretrained models usually differ in pretraining objectives, architectures, and datasets they are trained on which can affect downstream performance. In this contribution, we fine-tuned German BERT and German ELECTRA models to identify toxic (subtask 1), engaging (subtask 2), and fact-claiming comments (subtask 3) in Facebook data provided by the GermEval 2021 competition. We created ensembles of these models and investigated whether and how classification performance depends on the number of ensemble members and their composition. On out-of-sample data, our best ensemble achieved a macro-F1 score of 0.73 (for all subtasks), and F1 scores of 0.72, 0.70, and 0.76 for subtasks 1, 2, and 3, respectively.

PDF Abstract GermEval 2021 PDF GermEval 2021 Abstract

Code

Add Remove Mark official

dslaborg/germeval2021 official

Tasks

Add Remove

Classification of toxic, engaging, fact-claiming comments

Engaging Comment Classification

Ensemble Learning

Fact-Claiming Comment Classification

Toxic Comment Classification

Datasets

GermEval

Results from the Paper

Edit

Ranked #1 on Toxic Comment Classification on GermEval 2021 - Toxic Comments test set

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Classification of toxic, engaging, fact-claiming comments	GermEval	GBERT/GELECTRA Ensemble	Macro-F1	72.7	# 1	Compare
Engaging Comment Classification	GermEval 2021 - Engaging Comments test set	GBERT/GELECTRA Ensemble	F1	69.9	# 1	Compare
Fact-Claiming Comment Classification	GermEval 2021 - Fact-Claiming Comments test set	GBERT/GELECTRA Ensemble	F1	76.8	# 1	Compare
Toxic Comment Classification	GermEval 2021 - Toxic Comments test set	GBERT/GELECTRA Ensemble	F1	71.8	# 1	Compare

Methods

Add Remove

Adam • Attention Dropout • BERT • Dense Connections • Dropout • ELECTRA • GELU • Layer Normalization • Linear Layer • Linear Warmup With Linear Decay • Multi-Head Attention • Residual Connection • Scaled Dot-Product Attention • Softmax • Weight Decay • WordPiece

Edit Social Preview

FHAC at GermEval 2021: Identifying German toxic, engaging, and fact-claiming comments with ensemble learning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove