TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Multi-Label Text Classification	Reuters-21578	CB-NTR	Micro-F1	90.74	# 1
Multi-Label Text Classification	Reuters-21578	NTR-FL	Micro-F1	90.70	# 2
Multi-Label Text Classification	Reuters-21578	DB	Micro-F1	90.62	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/balancing-methods-for-multi-label-text/multi-label-text-classification-on-reuters-1)](https://paperswithcode.com/sota/multi-label-text-classification-on-reuters-1?p=balancing-methods-for-multi-label-text)`

Balancing Methods for Multi-label Text Classification with Long-Tailed Class Distribution

EMNLP 2021 · Yi Huang, Buse Giledereli, Abdullatif Köksal, Arzucan Özgür, Elif Ozkirimli ·

Multi-label text classification is a challenging task because it requires capturing label dependencies. It becomes even more challenging when class distribution is long-tailed. Resampling and re-weighting are common approaches used for addressing the class imbalance problem, however, they are not effective when there is label dependency besides class imbalance because they result in oversampling of common labels. Here, we introduce the application of balancing loss functions for multi-label text classification. We perform experiments on a general domain dataset with 90 labels (Reuters-21578) and a domain-specific dataset from PubMed with 18211 labels. We find that a distribution-balanced loss function, which inherently addresses both the class imbalance and label linkage problems, outperforms commonly used loss functions. Distribution balancing methods have been successfully used in the image recognition field. Here, we show their effectiveness in natural language processing. Source code is available at https://github.com/Roche/BalancedLossNLP.

PDF Abstract EMNLP 2021 PDF EMNLP 2021 Abstract

Code

Add Remove Mark official

Roche/BalancedLossNLP official

114

blessu/balancedlossnlp official

Tasks

Add Remove

Document Classification

Multi-Label Text Classification

Text Classification

Datasets

Reuters-21578

Results from the Paper

Edit

Ranked #1 on Multi-Label Text Classification on Reuters-21578

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Multi-Label Text Classification	Reuters-21578	CB-NTR	Micro-F1	90.74	# 1	Compare
Multi-Label Text Classification	Reuters-21578	NTR-FL	Micro-F1	90.70	# 2	Compare
Multi-Label Text Classification	Reuters-21578	DB	Micro-F1	90.62	# 3	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Balancing Methods for Multi-label Text Classification with Long-Tailed Class Distribution

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove