CoDiM: Learning with Noisy Labels via Contrastive Semi-Supervised Learning

23 Nov 2021  ·  Xin Zhang, Zixuan Liu, Kaiwen Xiao, Tian Shen, Junzhou Huang, Wei Yang, Dimitris Samaras, Xiao Han ·

Labels are costly and sometimes unreliable. Noisy label learning, semi-supervised learning, and contrastive learning are three different strategies for designing learning processes requiring less annotation cost. Semi-supervised learning and contrastive learning have been recently demonstrated to improve learning strategies that address datasets with noisy labels. Still, the inner connections between these fields as well as the potential to combine their strengths together have only started to emerge. In this paper, we explore further ways and advantages to fuse them. Specifically, we propose CSSL, a unified Contrastive Semi-Supervised Learning algorithm, and CoDiM (Contrastive DivideMix), a novel algorithm for learning with noisy labels. CSSL leverages the power of classical semi-supervised learning and contrastive learning technologies and is further adapted to CoDiM, which learns robustly from multiple types and levels of label noise. We show that CoDiM brings consistent improvements and achieves state-of-the-art results on multiple benchmarks.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Image Classification mini WebVision 1.0 CoDiM-Self (Inception-ResNet-v2) Top-1 Accuracy 80.12 # 9
Top-5 Accuracy 93.52 # 5
ImageNet Top-1 Accuracy 77.24 # 5
ImageNet Top-5 Accuracy 92.48 # 12
Image Classification mini WebVision 1.0 CoDiM-Sup (Inception-ResNet-v2) Top-1 Accuracy 80.88 # 5
Top-5 Accuracy 92.48 # 13
ImageNet Top-1 Accuracy 76.52 # 8
ImageNet Top-5 Accuracy 91.96 # 17

Methods