Partially View-aligned Representation Learning with Noise-robust Contrastive Loss

In real-world applications, it is common that only a portion of data is aligned across views due to spatial, temporal, or spatiotemporal asynchronism, thus leading to socalled Partially View-aligned Problem (PVP). To solve such a less-touched problem without the help of labels, we propose simultaneously learning representation and aligning data using a noise-robust contrastive loss. In brief, for each sample from one view, our method aims to identify its within-category counterparts from other views, and thus the cross-view correspondence could be established. As the contrastive learning needs data pairs as input, we construct positive pairs using the known correspondences and negative pairs using random sampling. To alleviate or even eliminate the influence of the false negatives caused by random sampling, we propose a noise-robust contrastive loss which could adaptively prevent the false negatives from dominating the network optimization. Different from the Hungarian algorithm and its variants, our solution to PVP aims to achieve category- instead of instance-level alignment. Thanks to the higher accessibility and scalability of the category-level alignment, it is more desirable for the tasks such as clustering and classification. In addition, to the best of our knowledge, this could be the first successful attempt of enabling contrastive learning robust to noisy labels. Extensive experiments show the promising performance of our method comparing with 10 state-of-the-art multi-view approaches in the clustering and classification tasks.

PDF CVPR 2021 PDF CVPR 2021 Abstract
Task Dataset Model Metric Name Metric Value Global Rank Benchmark
Partially View-aligned Multi-view Learning Caltech101 MvCLN NMI 43.07 # 1
Partially View-aligned Multi-view Learning n-MNIST MvCLN NMI 93.09 # 1
Partially View-aligned Multi-view Learning Reuters En-Fr MvCLN NMI 30.65 # 1
Partially View-aligned Multi-view Learning Scene-15 MvCLN NMI 39.90 # 1