SimMatch: Semi-supervised Learning with Similarity Matching

Learning with few labeled data has been a longstanding problem in the computer vision and machine learning research community. In this paper, we introduced a new semi-supervised learning framework, SimMatch, which simultaneously considers semantic similarity and instance similarity. In SimMatch, the consistency regularization will be applied on both semantic-level and instance-level. The different augmented views of the same instance are encouraged to have the same class prediction and similar similarity relationship respected to other instances. Next, we instantiated a labeled memory buffer to fully leverage the ground truth labels on instance-level and bridge the gaps between the semantic and instance similarities. Finally, we proposed the \textit{unfolding} and \textit{aggregation} operation which allows these two similarities be isomorphically transformed with each other. In this way, the semantic and instance pseudo-labels can be mutually propagated to generate more high-quality and reliable matching targets. Extensive experimental results demonstrate that SimMatch improves the performance of semi-supervised learning tasks across different benchmark datasets and different settings. Notably, with 400 epochs of training, SimMatch achieves 67.2\%, and 74.4\% Top-1 Accuracy with 1\% and 10\% labeled examples on ImageNet, which significantly outperforms the baseline methods and is better than previous semi-supervised learning frameworks. Code and pre-trained models are available at https://github.com/KyleZheng1997/simmatch.

PDF Abstract CVPR 2022 PDF CVPR 2022 Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Semi-Supervised Image Classification cifar-100, 10000 Labels SimMatch Percentage error 20.58 # 3
Semi-Supervised Image Classification CIFAR-100, 2500 Labels SimMatch Percentage error 25.07 # 2
Semi-Supervised Image Classification CIFAR-100, 400 Labels SimMatch Percentage error 37.81 # 5
Semi-Supervised Image Classification CIFAR-10, 250 Labels SimMatch Percentage error 4.84 # 7
Semi-Supervised Image Classification CIFAR-10, 4000 Labels SimMatch Percentage error 3.96 # 3
Semi-Supervised Image Classification CIFAR-10, 40 Labels SimMatch Percentage error 5.6 # 7
Semi-Supervised Image Classification ImageNet - 10% labeled data SimMatch (ResNet-50) Top 1 Accuracy 74.4% # 22
Semi-Supervised Image Classification ImageNet - 1% labeled data SimMatch (ResNet-50) Top 1 Accuracy 67.2% # 19

Methods


No methods listed for this paper. Add relevant methods here