A Second-Order Approach to Learning with Instance-Dependent Label Noise

CVPR 2021  ·  Zhaowei Zhu, Tongliang Liu, Yang Liu ·

The presence of label noise often misleads the training of deep neural networks. Departing from the recent literature which largely assumes the label noise rate is only determined by the true label class, the errors in human-annotated labels are more likely to be dependent on the difficulty levels of tasks, resulting in settings with instance-dependent label noise. We first provide evidences that the heterogeneous instance-dependent label noise is effectively down-weighting the examples with higher noise rates in a non-uniform way and thus causes imbalances, rendering the strategy of directly applying methods for class-dependent label noise questionable. Built on a recent work peer loss [24], we then propose and study the potentials of a second-order approach that leverages the estimation of several covariance terms defined between the instance-dependent noise rates and the Bayes optimal label. We show that this set of second-order statistics successfully captures the induced imbalances. We further proceed to show that with the help of the estimated second-order statistics, we identify a new loss function whose expected risk of a classifier under instance-dependent label noise is equivalent to a new problem with only class-dependent label noise. This fact allows us to apply existing solutions to handle this better-studied setting. We provide an efficient procedure to estimate these second-order statistics without accessing either ground truth labels or prior knowledge of the noise rates. Experiments on CIFAR10 and CIFAR100 with synthetic instance-dependent label noise and Clothing1M with real-world human label noise verify our approach. Our implementation is available at https://github.com/UCSC-REAL/CAL.

PDF Abstract CVPR 2021 PDF CVPR 2021 Abstract
Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Image Classification with Label Noise CIFAR-100, 20% IDN CAL Accuracy 69.11% # 2
Image Classification with Label Noise CIFAR-100, 40% IDN CAL Accuracy 63.17% # 2
Image Classification with Label Noise CIFAR-100, 60% IDN CAL Accuracy 43.58% # 3
Image Classification with Label Noise CIFAR-10, 20% IDN CAL Accuracy 92.01% # 2
Image Classification with Label Noise CIFAR-10, 40% IDN CAL Accuracy 84.96% # 3
Image Classification with Label Noise CIFAR-10, 60% IDN CAL Accuracy 79.82% # 2
Image Classification Clothing1M CAL Accuracy 74.17% # 21

Methods


No methods listed for this paper. Add relevant methods here