Wilds

Introduced by Koh et al. in WILDS: A Benchmark of in-the-Wild Distribution Shifts

Builds on top of recent data collection efforts by domain experts in these applications and provides a unified collection of datasets with evaluation metrics and train/test splits that are representative of real-world distribution shifts.

The v2.0 update adds unlabeled data to 8 datasets. The labeled data and evaluation metrics are exactly the same, so all previous results are directly comparable.

Source: WILDS: A Benchmark of in-the-Wild Distribution Shifts

Homepage

Benchmarks

Add a new result Link an existing benchmark

No benchmarks yet. Start a new benchmark or link an existing one.

Papers

Paper	Code	Results	Date	Stars

Dataset Loaders

Add Remove

p-lambda/wilds

532

Tasks

Similar Datasets

Civil Comments

Colored MNIST

Wilds

Benchmarks

Add a new result Link an existing benchmark

Papers

Dataset Loaders

Add Remove

Tasks

Similar Datasets

Civil Comments

Colored MNIST

fMoW

Usage

License

Modalities

Languages

Wilds

Benchmarks Edit Add a new result Link an existing benchmark

Papers

Dataset Loaders Edit Add Remove

Tasks Edit

Similar Datasets

Civil Comments

Colored MNIST

fMoW

Usage

License Edit

Modalities Edit

Languages Edit

Benchmarks

Add a new result Link an existing benchmark

Dataset Loaders

Add Remove

Tasks

License

Modalities

Languages