Scaling Up Semi-supervised Learning with Unconstrained Unlabelled Data

2 Jun 2023  ยท  Shuvendu Roy, Ali Etemad ยท

We propose UnMixMatch, a semi-supervised learning framework which can learn effective representations from unconstrained unlabelled data in order to scale up performance. Most existing semi-supervised methods rely on the assumption that labelled and unlabelled samples are drawn from the same distribution, which limits the potential for improvement through the use of free-living unlabeled data. Consequently, the generalizability and scalability of semi-supervised learning are often hindered by this assumption. Our method aims to overcome these constraints and effectively utilize unconstrained unlabelled data in semi-supervised learning. UnMixMatch consists of three main components: a supervised learner with hard augmentations that provides strong regularization, a contrastive consistency regularizer to learn underlying representations from the unlabelled data, and a self-supervised loss to enhance the representations that are learnt from the unlabelled data. We perform extensive experiments on 4 commonly used datasets and demonstrate superior performance over existing semi-supervised methods with a performance boost of 4.79%. Extensive ablation and sensitivity studies show the effectiveness and impact of each of the proposed components of our method.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Semi-Supervised Image Classification CIFAR-100 (10000 Labels, ImageNet-100 Unlabeled) UnMixMatch Accuracy 71.73 # 1
Semi-Supervised Image Classification CIFAR-100 (250 Labels, ImageNet-100 Unlabeled) UnMixMatch Accuarcy 54.18 # 2
Semi-Supervised Image Classification CIFAR-100 (400 Labels, ImageNet-100 Unlabeled) UnMixMatch Accuracy 26.13 # 1
Semi-Supervised Image Classification CIFAR-10, 100 Labels (OpenSet, 6/4) UnMixMatch Accuracy 96.8 # 1
Semi-Supervised Image Classification CIFAR-10 (250 Labels, ImageNet-100 Unlabeled) UnMixMatch Accuracy 68.72 # 1
Semi-Supervised Image Classification CIFAR-10 (4000 Labels, ImageNet-100 Unlabeled) UnMixMatch Accuracy 89.58 # 1
Semi-Supervised Image Classification CIFAR-10, 400 Labels (OpenSet, 6/4) UnMixMatch Accuracy 97.2 # 1
Image Classification CIFAR-10 (40 Labels, ImageNet-100 Unlabeled) UnMixMatch Accuarcy 52.07 # 1
Semi-Supervised Image Classification CIFAR-10, 50 Labels (OpenSet, 6/4) UnMixMatch Accuracy 95.7 # 1
Semi-Supervised Image Classification STL-10 (1000 Labels, ImageNet-100 Unlabeled) UnMixMatch Accuracy 84.73 # 1
Semi-Supervised Image Classification SVHN (1000 Labels, ImageNet-100 Unlabeled) UnMixMatch Accuracy 91.03 # 1
Semi-Supervised Image Classification SVHN (250 Labels, ImageNet-100 Unlabeled) UnMixMatch Accuracy 80.78 # 1
Semi-Supervised Image Classification SVHN (40 Labels, ImageNet-100 Unlabeled) UnMixMatch Accuracy 72.9 # 1

Methods


No methods listed for this paper. Add relevant methods here