no code implementations • 1 Dec 2022 • Sidra Hanif, Longin Jan Latecki
In our work, we propose to aggregate features from pretrained images and text embeddings to learn abstract visual concepts from GCD.
Image Captioning Text-to-Image Generation