Balanced Contrastive Learning for Long-Tailed Visual Recognition

Real-world data typically follow a long-tailed distribution, where a few majority categories occupy most of the data while most minority categories contain a limited number of samples. Classification models minimizing cross-entropy struggle to represent and classify the tail classes. Although the problem of learning unbiased classifiers has been well studied, methods for representing imbalanced data are under-explored. In this paper, we focus on representation learning for imbalanced data. Recently, supervised contrastive learning has shown promising performance on balanced data recently. However, through our theoretical analysis, we find that for long-tailed data, it fails to form a regular simplex which is an ideal geometric configuration for representation learning. To correct the optimization behavior of SCL and further improve the performance of long-tailed visual recognition, we propose a novel loss for balanced contrastive learning (BCL). Compared with SCL, we have two improvements in BCL: class-averaging, which balances the gradient contribution of negative classes; class-complement, which allows all classes to appear in every mini-batch. The proposed balanced contrastive learning (BCL) method satisfies the condition of forming a regular simplex and assists the optimization of cross-entropy. Equipped with BCL, the proposed two-branch framework can obtain a stronger feature representation and achieve competitive performance on long-tailed benchmark datasets such as CIFAR-10-LT, CIFAR-100-LT, ImageNet-LT, and iNaturalist2018. Our code is available at https://github.com/FlamieZhu/BCL .

PDF Abstract CVPR 2022 PDF CVPR 2022 Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Long-tail Learning CIFAR-100-LT (ρ=100) BCL(ResNet-32) Error Rate 46.1 # 15
Long-tail Learning CIFAR-100-LT (ρ=50) BCL(ResNet-32) Error Rate 43.4 # 16
Long-tail Learning CIFAR-10-LT (ρ=10) BCL(ResNet-32) Error Rate 8.9 # 10
Long-tail Learning on CIFAR-10-LT (ρ=100) CIFAR-10-LT (ρ=100) BCL(ResNet-32) Error Rate 15.68 # 1
Long-tail Learning ImageNet-LT BCL(ResNeXt-50) Top-1 Accuracy 57.1 # 28
Long-tail Learning iNaturalist 2018 BCL(ResNet-50) Top-1 Accuracy 71.8% # 23

Methods