AutoMix: Unveiling the Power of Mixup for Stronger Classifiers

24 Mar 2021  ·  Zicheng Liu, Siyuan Li, Di wu, Zihan Liu, ZhiYuan Chen, Lirong Wu, Stan Z. Li ·

Data mixing augmentation have proved to be effective in improving the generalization ability of deep neural networks. While early methods mix samples by hand-crafted policies (e.g., linear interpolation), recent methods utilize saliency information to match the mixed samples and labels via complex offline optimization. However, there arises a trade-off between precise mixing policies and optimization complexity. To address this challenge, we propose a novel automatic mixup (AutoMix) framework, where the mixup policy is parameterized and serves the ultimate classification goal directly. Specifically, AutoMix reformulates the mixup classification into two sub-tasks (i.e., mixed sample generation and mixup classification) with corresponding sub-networks and solves them in a bi-level optimization framework. For the generation, a learnable lightweight mixup generator, Mix Block, is designed to generate mixed samples by modeling patch-wise relationships under the direct supervision of the corresponding mixed labels. To prevent the degradation and instability of bi-level optimization, we further introduce a momentum pipeline to train AutoMix in an end-to-end manner. Extensive experiments on nine image benchmarks prove the superiority of AutoMix compared with state-of-the-art in various classification scenarios and downstream tasks.

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Result Benchmark
Image Classification CIFAR-10 ResNeXt-50 (AutoMix) Percentage correct 97.84 # 62
Image Classification CIFAR-100 ResNeXt-50(32x4d) + AutoMix Percentage correct 83.64 # 85
Image Classification CIFAR-100 WRN-28-8 +AutoMix Percentage correct 85.16 # 68
Image Classification ImageNet ResNet-50 (AutoMix) Top 1 Accuracy 79.25% # 708
Number of params 25.6M # 601
Image Classification ImageNet ResNet-101 (AutoMix) Top 1 Accuracy 80.98% # 616
Number of params 44.6M # 703
Image Classification ImageNet ResNet-34 (AutoMix) Top 1 Accuracy 76.1% # 855
Number of params 21.8M # 553
Image Classification ImageNet ResNet-18 (AutoMix) Top 1 Accuracy 72.05% # 928
Number of params 11.7M # 492
Image Classification iNaturalist 2018 ResNet-50 (AutoMix) Top-1 Accuracy 64.73% # 44
Image Classification iNaturalist 2018 ResNeXt-101 (AutoMix) Top-1 Accuracy 70.49% # 32
Image Classification Places205 AutoMix (ResNet-50 Supervised) Top 1 Accuracy 64.1 # 8
Image Classification Tiny ImageNet Classification ResNet18 (AutoMix) Validation Acc 67.33% # 19
Image Classification Tiny ImageNet Classification ResNeXt-50 (AutoMix) Validation Acc 70.72% # 14

Methods