Adaptive Methods for Aggregated Domain Generalization

9 Dec 2021  ·  Xavier Thomas, Dhruv Mahajan, Alex Pentland, Abhimanyu Dubey ·

Domain generalization involves learning a classifier from a heterogeneous collection of training sources such that it generalizes to data drawn from similar unknown target domains, with applications in large-scale learning and personalized inference. In many settings, privacy concerns prohibit obtaining domain labels for the training data samples, and instead only have an aggregated collection of training points. Existing approaches that utilize domain labels to create domain-invariant feature representations are inapplicable in this setting, requiring alternative approaches to learn generalizable classifiers. In this paper, we propose a domain-adaptive approach to this problem, which operates in two steps: (a) we cluster training data within a carefully chosen feature space to create pseudo-domains, and (b) using these pseudo-domains we learn a domain-adaptive classifier that makes predictions using information about both the input and the pseudo-domain it belongs to. Our approach achieves state-of-the-art performance on a variety of domain generalization benchmarks without using domain labels whatsoever. Furthermore, we provide novel theoretical guarantees on domain generalization using cluster information. Our approach is amenable to ensemble-based methods and provides substantial gains even on large-scale benchmark datasets. The code can be found at: https://github.com/xavierohan/AdaClust_DomainBed

PDF Abstract

Results from the Paper


Task Dataset Model Metric Name Metric Value Global Rank Result Benchmark
Domain Generalization DomainNet AdaClust (ResNet-50) Average Accuracy 43.3 # 26
Domain Generalization DomainNet AdaClust (ResNet-50, SWAD) Average Accuracy 46.7 # 19
Domain Generalization Office-Home AdaClust (ResNet-50, SWAD) Average Accuracy 69.4 # 27
Domain Generalization Office-Home AdaClust (ResNet-50) Average Accuracy 67.7 # 30
Domain Generalization PACS AdaClust (ResNet-50) Average Accuracy 87.0 # 32
Domain Generalization PACS AdaClust (ResNet-50, SWAD) Average Accuracy 89.2 # 16
Domain Generalization TerraIncognita AdaClust (ResNet-50, SWAD) Average Accuracy 50.6 # 16
Domain Generalization TerraIncognita AdaClust (ResNet-50) Average Accuracy 48.1 # 21
Domain Generalization VLCS AdaClust (ResNet-50) Average Accuracy 78.9 # 25
Domain Generalization VLCS AdaClust (ResNet-50, SWAD) Average Accuracy 79.6 # 18

Methods


No methods listed for this paper. Add relevant methods here