The instability in GAN training has been a long-standing problem despite remarkable research efforts.
In this work, we propose a framework called InterFaceGAN to interpret the disentangled face representation learned by the state-of-the-art GAN models and study the properties of the facial semantics encoded in the latent space.
Although existing models can generate realistic target images, it's difficult to maintain the structure of the source image.
In addition, the lack of high-quality paired data remains an obstacle for both methods.
To address this problem, we propose a controllable face synthesis model (CFSM) that can mimic the distribution of target datasets in a style latent space.
Image generation has been a long sought-after but challenging task, and performing the generation task in an efficient manner is similarly difficult.
The capacity to recognize faces under varied poses is a fundamental human ability that presents a unique challenge for computer vision systems.
In this paper, we present the Surrey Face Model, a multi-resolution 3D Morphable Model that we make available to the public for non-commercial purposes.
Second, to enforce that different generators capture diverse high probability modes, the discriminator of MAD-GAN is designed such that along with finding the real and fake samples, it is also required to identify the generator that generated the given fake sample.
In this paper, we take a different approach, where we formulate the original problem as a stage-wise learning problem.