DoubleMatch: Improving Semi-Supervised Learning with Self-Supervision

11 May 2022  ·  Erik Wallin, Lennart Svensson, Fredrik Kahl, Lars Hammarstrand ·

Following the success of supervised learning, semi-supervised learning (SSL) is now becoming increasingly popular. SSL is a family of methods, which in addition to a labeled training set, also use a sizable collection of unlabeled data for fitting a model. Most of the recent successful SSL methods are based on pseudo-labeling approaches: letting confident model predictions act as training labels. While these methods have shown impressive results on many benchmark datasets, a drawback of this approach is that not all unlabeled data are used during training. We propose a new SSL algorithm, DoubleMatch, which combines the pseudo-labeling technique with a self-supervised loss, enabling the model to utilize all unlabeled data in the training process. We show that this method achieves state-of-the-art accuracies on multiple benchmark datasets while also reducing training times compared to existing SSL methods. Code is available at https://github.com/walline/doublematch.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Semi-Supervised Image Classification cifar-100, 10000 Labels DoubleMatch Percentage error 21.22± 0.17 # 5
Semi-Supervised Image Classification CIFAR-100, 2500 Labels DoubleMatch Percentage error 27.07± 0.26 # 9
Semi-Supervised Image Classification CIFAR-100, 400 Labels DoubleMatch Percentage error 41.83± 1.22 # 11
Semi-Supervised Image Classification CIFAR-10, 250 Labels DoubleMatch Percentage error 5.56±0.42 # 14
Semi-Supervised Image Classification CIFAR-10, 4000 Labels DoubleMatch Percentage error 4.65±0.17 # 16
Semi-Supervised Image Classification CIFAR-10, 40 Labels DoubleMatch Percentage error 13.59±5.60 # 16
Semi-Supervised Image Classification STL-10, 1000 Labels DoubleMatch Accuracy 95.65±0.20 # 1
Semi-Supervised Image Classification SVHN, 1000 labels DoubleMatch Accuracy 97.90 ± 0.07 # 2
Semi-Supervised Image Classification SVHN, 250 Labels DoubleMatch Accuracy 97.63±0.35 # 3
Semi-Supervised Image Classification SVHN, 40 Labels DoubleMatch Percentage error 15.37±11.81 # 5

Methods


No methods listed for this paper. Add relevant methods here