TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Out-of-Distribution Generalization	UrbanCars	SUBG (CoObj)	BG Gap	-60.2	# 1
Out-of-Distribution Generalization	UrbanCars	SUBG (CoObj)	CoObj Gap	+2.5	# 1
Out-of-Distribution Generalization	UrbanCars	SUBG (CoObj)	BG+CoObj Gap	-62.4	# 1
Out-of-Distribution Generalization	UrbanCars	SUBG (BG)	BG Gap	+1.3	# 1
Out-of-Distribution Generalization	UrbanCars	SUBG (BG)	CoObj Gap	-36.4	# 1
Out-of-Distribution Generalization	UrbanCars	SUBG (BG)	BG+CoObj Gap	-35.8	# 1
Out-of-Distribution Generalization	UrbanCars	SUBG (BG+CoObj)	BG Gap	-4.7	# 1
Out-of-Distribution Generalization	UrbanCars	SUBG (BG+CoObj)	CoObj Gap	-0.3	# 1
Out-of-Distribution Generalization	UrbanCars	SUBG (BG+CoObj)	BG+CoObj Gap	-6.3	# 1

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/simple-data-balancing-achieves-competitive/out-of-distribution-generalization-on)](https://paperswithcode.com/sota/out-of-distribution-generalization-on?p=simple-data-balancing-achieves-competitive)`

Simple data balancing achieves competitive worst-group-accuracy

27 Oct 2021 · Badr Youbi Idrissi, Martin Arjovsky, Mohammad Pezeshki, David Lopez-Paz ·

We study the problem of learning classifiers that perform well across (known or unknown) groups of data. After observing that common worst-group-accuracy datasets suffer from substantial imbalances, we set out to compare state-of-the-art methods to simple balancing of classes and groups by either subsampling or reweighting data. Our results show that these data balancing baselines achieve state-of-the-art-accuracy, while being faster to train and requiring no additional hyper-parameters. In addition, we highlight that access to group information is most critical for model selection purposes, and not so much during training. All in all, our findings beg closer examination of benchmarks and methods for research in worst-group-accuracy optimization.

PDF Abstract

Code

Add Remove Mark official

facebookresearch/balancinggroups official

Tasks

Add Remove

Model Selection

Out-of-Distribution Generalization

Datasets

CelebA

MultiNLI Civil Comments

UrbanCars

Results from the Paper

Edit

Ranked #1 on Out-of-Distribution Generalization on UrbanCars

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Out-of-Distribution Generalization	UrbanCars	SUBG (CoObj)	BG Gap	-60.2	# 1	Compare
			CoObj Gap	+2.5	# 1	Compare
			BG+CoObj Gap	-62.4	# 1	Compare
Out-of-Distribution Generalization	UrbanCars	SUBG (BG)	BG Gap	+1.3	# 1	Compare
			CoObj Gap	-36.4	# 1	Compare
			BG+CoObj Gap	-35.8	# 1	Compare
Out-of-Distribution Generalization	UrbanCars	SUBG (BG+CoObj)	BG Gap	-4.7	# 1	Compare
			CoObj Gap	-0.3	# 1	Compare
			BG+CoObj Gap	-6.3	# 1	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Simple data balancing achieves competitive worst-group-accuracy

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove