Toward Understanding Supervised Representation Learning with RKHS and GAN

1 Jan 2021  ·  Xu Liao, Jin Liu, Tianwen Wen, Yuling Jiao, Jian Huang ·

The success of deep supervised learning depends on its automatic data representation abilities. A good representation of high-dimensional complex data should enjoy low-dimensionally and disentanglement while losing as little information as possible. In this work, we give a statistical understanding of how deep representation goals can be achieved with reproducing kernel Hilbert spaces (RKHS) and generative adversarial networks (GAN). At the population level, we formulate the ideal representation learning task as that of finding a nonlinear map that minimizes the sum of losses characterizing conditional independence (with RKHS) and disentanglement (with GAN). We estimate the target map at the sample level nonparametrically with deep neural networks. We prove the consistency in terms of the population objective function value. We validate the proposed methods via comprehensive numerical experiments and real data analysis in the context of regression and classification. The resulting prediction accuracies are better than state-of-the-art methods.

PDF Abstract
Task Dataset Model Metric Name Metric Value Global Rank Uses Extra
Training Data
Benchmark
Image Classification Kuzushiji-MNIST NSRL (WGAN) (d=32) Accuracy 98.72 # 9
Image Classification Kuzushiji-MNIST NSRL (WGAN) (d=16) Accuracy 98.66 # 11
Image Classification Kuzushiji-MNIST NSRL (WGAN) (d=8) Accuracy 98.68 # 10
Image Classification Kuzushiji-MNIST NSRL (log D) (d=32) Accuracy 98.63 # 12
Image Classification Kuzushiji-MNIST NSRL (log D) (d=16) Accuracy 98.81 # 5
Image Classification Kuzushiji-MNIST NSRL (log D) (d=8) Accuracy 98.61 # 13
Image Classification Kuzushiji-MNIST CN(d=32) Accuracy 98.84 # 4
Image Classification Kuzushiji-MNIST CN(d=16) Accuracy 98.80 # 6
Image Classification Kuzushiji-MNIST CN(d=8) Accuracy 98.60 # 14
Image Classification STL-10 CN(d=32) Percentage correct 98.17 # 10
Image Classification STL-10 NSRL+CN(d=64) Percentage correct 98.24 # 9
Image Classification STL-10 NSRL+CN(d=32) Percentage correct 98.34 # 8
Image Classification STL-10 NSRL+CN(d=128) Percentage correct 98.36 # 6
Image Classification STL-10 CN(d=64) Percentage correct 98.36 # 6
Image Classification STL-10 CN(d=128) Percentage correct 98.45 # 5

Methods


No methods listed for this paper. Add relevant methods here