TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Self-Supervised Learning	DABS	Pretraining: None	Natural Images	10.1	# 3
Self-Supervised Learning	DABS	Pretraining: None	Text	42.3	# 3
Self-Supervised Learning	DABS	Pretraining: None	Speech	24.9	# 3
Self-Supervised Learning	DABS	Pretraining: None	Sensors	69.8	# 3
Self-Supervised Learning	DABS	Pretraining: None	Med. Imaging	68.1	# 3
Self-Supervised Learning	DABS	Pretraining: None	Images & Text	57.5	# 1
Self-Supervised Learning	DABS	Pretraining: ShED	Natural Images	20.9	# 2
Self-Supervised Learning	DABS	Pretraining: ShED	Text	48.4	# 1
Self-Supervised Learning	DABS	Pretraining: ShED	Speech	36.5	# 2
Self-Supervised Learning	DABS	Pretraining: ShED	Sensors	88.7	# 1
Self-Supervised Learning	DABS	Pretraining: ShED	Med. Imaging	74.5	# 1
Self-Supervised Learning	DABS	Pretraining: ShED	Images & Text	54.3	# 2
Self-Supervised Learning	DABS	Pretraining: e-Mix	Natural Images	27.9	# 1
Self-Supervised Learning	DABS	Pretraining: e-Mix	Text	44.1	# 2
Self-Supervised Learning	DABS	Pretraining: e-Mix	Speech	41.8	# 1
Self-Supervised Learning	DABS	Pretraining: e-Mix	Sensors	79.5	# 2
Self-Supervised Learning	DABS	Pretraining: e-Mix	Med. Imaging	72.4	# 2
Self-Supervised Learning	DABS	Pretraining: e-Mix	Images & Text	48.9	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/dabs-a-domain-agnostic-benchmark-for-self/self-supervised-learning-on-dabs)](https://paperswithcode.com/sota/self-supervised-learning-on-dabs?p=dabs-a-domain-agnostic-benchmark-for-self)`

DABS: A Domain-Agnostic Benchmark for Self-Supervised Learning

23 Nov 2021 · Alex Tamkin, Vincent Liu, Rongfei Lu, Daniel Fein, Colin Schultz, Noah Goodman ·

Self-supervised learning algorithms, including BERT and SimCLR, have enabled significant strides in fields like natural language processing, computer vision, and speech processing. However, these algorithms are domain-specific, meaning that new self-supervised learning algorithms must be developed for each new setting, including myriad healthcare, scientific, and multimodal domains. To catalyze progress toward domain-agnostic methods, we introduce DABS: a Domain-Agnostic Benchmark for Self-supervised learning. To perform well on DABS, an algorithm is evaluated on seven diverse domains: natural images, multichannel sensor data, English text, speech recordings, multilingual text, chest x-rays, and images with text descriptions. Each domain contains an unlabeled dataset for pretraining; the model is then is scored based on its downstream performance on a set of labeled tasks in the domain. We also present e-Mix and ShED: two baseline domain-agnostic algorithms; their relatively modest performance demonstrates that significant progress is needed before self-supervised learning is an out-of-the-box solution for arbitrary domains. Code for benchmark datasets and baseline algorithms is available at https://github.com/alextamkin/dabs.

PDF Abstract

Code

Add Remove Mark official

alextamkin/dabs official

104

Tasks

Add Remove

Self-Supervised Learning

Datasets

Introduced in the Paper:

DABS

Used in the Paper:

CIFAR-10

MS COCO

GLUE

LibriSpeech

WikiText-2

CoLA

CheXpert

WikiText-103 PAMAP2

Results from the Paper

Edit

Ranked #1 on Self-Supervised Learning on DABS

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Self-Supervised Learning	DABS	Pretraining: None	Natural Images	10.1	# 3	Compare
			Text	42.3	# 3	Compare
			Speech	24.9	# 3	Compare
			Sensors	69.8	# 3	Compare
			Med. Imaging	68.1	# 3	Compare
			Images & Text	57.5	# 1	Compare
Self-Supervised Learning	DABS	Pretraining: ShED	Natural Images	20.9	# 2	Compare
			Text	48.4	# 1	Compare
			Speech	36.5	# 2	Compare
			Sensors	88.7	# 1	Compare
			Med. Imaging	74.5	# 1	Compare
			Images & Text	54.3	# 2	Compare
Self-Supervised Learning	DABS	Pretraining: e-Mix	Natural Images	27.9	# 1	Compare
			Text	44.1	# 2	Compare
			Speech	41.8	# 1	Compare
			Sensors	79.5	# 2	Compare
			Med. Imaging	72.4	# 2	Compare
			Images & Text	48.9	# 3	Compare

Methods

Add Remove

1x1 Convolution • Adam • Attention Dropout • Average Pooling • Batch Normalization • BERT • Bottleneck Residual Block • ColorJitter • Convolution • Dense Connections • Dropout • Feedforward Network • GELU • Global Average Pooling • Kaiming Initialization • Layer Normalization • Linear Layer • Linear Warmup With Linear Decay • Max Pooling • Multi-Head Attention • NT-Xent • Random Gaussian Blur • Random Resized Crop • ReLU • Residual Block • Residual Connection • ResNet • Scaled Dot-Product Attention • SimCLR • Softmax • Weight Decay • WordPiece

Edit Social Preview

DABS: A Domain-Agnostic Benchmark for Self-Supervised Learning

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove