TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Long-tail Learning	CIFAR-100-LT (ρ=100)	PaCo + SAM	Error Rate	47.0	# 20
Long-tail Learning	CIFAR-100-LT (ρ=100)	GLMC + SAM	Error Rate	40.99	# 6
Long-tail Learning	CIFAR-100-LT (ρ=100)	VS + SAM	Error Rate	53.4	# 38
Long-tail Learning	CIFAR-100-LT (ρ=200)	PaCo + SAM	Error Rate	52.0	# 1
Long-tail Learning	CIFAR-100-LT (ρ=50)	GLMC + SAM	Error Rate	34.72	# 5
Long-tail Learning	CIFAR-10-LT (ρ=10)	LDAM + DRW + SAM	Error Rate	10.6	# 28
Long-tail Learning	CIFAR-10-LT (ρ=100)	GLMC + SAM	Error Rate	10.82	# 2
Long-tail Learning	CIFAR-10-LT (ρ=100)	VS + SAM	Error Rate	17.6	# 15
Long-tail Learning	CIFAR-10-LT (ρ=200)	LDAM + DRW + SAM	Error Rate	21.9	# 1
Long-tail Learning	CIFAR-10-LT (ρ=50)	GLMC + SAM	Error Rate	8.44	# 1
Long-tail Learning	ImageNet-LT	LDAM + DRW + SAM	Top-1 Accuracy	53.1	# 40
Long-tail Learning	iNaturalist 2018	LDAM + DRW + SAM	Top-1 Accuracy	70.1	# 30

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/escaping-saddle-points-for-effective/long-tail-learning-on-cifar-100-lt-r-200)](https://paperswithcode.com/sota/long-tail-learning-on-cifar-100-lt-r-200?p=escaping-saddle-points-for-effective)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/escaping-saddle-points-for-effective/long-tail-learning-on-cifar-10-lt-r-200)](https://paperswithcode.com/sota/long-tail-learning-on-cifar-10-lt-r-200?p=escaping-saddle-points-for-effective)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/escaping-saddle-points-for-effective/long-tail-learning-on-cifar-10-lt-r-50)](https://paperswithcode.com/sota/long-tail-learning-on-cifar-10-lt-r-50?p=escaping-saddle-points-for-effective)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/escaping-saddle-points-for-effective/long-tail-learning-on-cifar-10-lt-r-100)](https://paperswithcode.com/sota/long-tail-learning-on-cifar-10-lt-r-100?p=escaping-saddle-points-for-effective)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/escaping-saddle-points-for-effective/long-tail-learning-on-cifar-100-lt-r-50)](https://paperswithcode.com/sota/long-tail-learning-on-cifar-100-lt-r-50?p=escaping-saddle-points-for-effective)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/escaping-saddle-points-for-effective/long-tail-learning-on-cifar-100-lt-r-100)](https://paperswithcode.com/sota/long-tail-learning-on-cifar-100-lt-r-100?p=escaping-saddle-points-for-effective)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/escaping-saddle-points-for-effective/long-tail-learning-on-cifar-10-lt-r-10)](https://paperswithcode.com/sota/long-tail-learning-on-cifar-10-lt-r-10?p=escaping-saddle-points-for-effective)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/escaping-saddle-points-for-effective/long-tail-learning-on-inaturalist-2018)](https://paperswithcode.com/sota/long-tail-learning-on-inaturalist-2018?p=escaping-saddle-points-for-effective)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/escaping-saddle-points-for-effective/long-tail-learning-on-imagenet-lt)](https://paperswithcode.com/sota/long-tail-learning-on-imagenet-lt?p=escaping-saddle-points-for-effective)`

Escaping Saddle Points for Effective Generalization on Class-Imbalanced Data

28 Dec 2022 · Harsh Rangwani, Sumukh K Aithal, Mayank Mishra, R. Venkatesh Babu ·

Real-world datasets exhibit imbalances of varying types and degrees. Several techniques based on re-weighting and margin adjustment of loss are often used to enhance the performance of neural networks, particularly on minority classes. In this work, we analyze the class-imbalanced learning problem by examining the loss landscape of neural networks trained with re-weighting and margin-based techniques. Specifically, we examine the spectral density of Hessian of class-wise loss, through which we observe that the network weights converge to a saddle point in the loss landscapes of minority classes. Following this observation, we also find that optimization methods designed to escape from saddle points can be effectively used to improve generalization on minority classes. We further theoretically and empirically demonstrate that Sharpness-Aware Minimization (SAM), a recent technique that encourages convergence to a flat minima, can be effectively used to escape saddle points for minority classes. Using SAM results in a 6.2\% increase in accuracy on the minority classes over the state-of-the-art Vector Scaling Loss, leading to an overall average increase of 4\% across imbalanced datasets. The code is available at: https://github.com/val-iisc/Saddle-LongTail.

PDF Abstract

Code

Add Remove Mark official

val-iisc/saddle-longtail official

Tasks

Add Remove

Long-tail Learning

Datasets

CIFAR-10

CIFAR-100

iNaturalist ImageNet-LT CIFAR100-LT

Results from the Paper

Edit

Ranked #1 on Long-tail Learning on CIFAR-10-LT (ρ=50)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Long-tail Learning	CIFAR-100-LT (ρ=100)	PaCo + SAM	Error Rate	47.0	# 20	Compare
Long-tail Learning	CIFAR-100-LT (ρ=100)	GLMC + SAM	Error Rate	40.99	# 6	Compare
Long-tail Learning	CIFAR-100-LT (ρ=200)	PaCo + SAM	Error Rate	52.0	# 1	Compare
Long-tail Learning	CIFAR-100-LT (ρ=50)	GLMC + SAM	Error Rate	34.72	# 5	Compare
Long-tail Learning	CIFAR-10-LT (ρ=10)	LDAM + DRW + SAM	Error Rate	10.6	# 28	Compare
Long-tail Learning	CIFAR-10-LT (ρ=100)	GLMC + SAM	Error Rate	10.82	# 2	Compare
Long-tail Learning	CIFAR-10-LT (ρ=200)	LDAM + DRW + SAM	Error Rate	21.9	# 1	Compare
Long-tail Learning	CIFAR-10-LT (ρ=50)	GLMC + SAM	Error Rate	8.44	# 1	Compare
Long-tail Learning	ImageNet-LT	LDAM + DRW + SAM	Top-1 Accuracy	53.1	# 40	Compare
Long-tail Learning	iNaturalist 2018	LDAM + DRW + SAM	Top-1 Accuracy	70.1	# 30	Compare

Results from Other Papers

Task	Dataset	Model	Metric Name	Metric Value	Rank	Source Paper	Compare
Long-tail Learning	CIFAR-100-LT (ρ=100)	VS + SAM	Error Rate	53.4	# 38		See all
Long-tail Learning	CIFAR-10-LT (ρ=100)	VS + SAM	Error Rate	17.6	# 15		See all

Methods

Add Remove

Sharpness-Aware Minimization

Edit Social Preview

Escaping Saddle Points for Effective Generalization on Class-Imbalanced Data

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit