4 code implementations • NeurIPS 2019 • Simao Herdade, Armin Kappeler, Kofi Boakye, Joao Soares
Image captioning models typically follow an encoder-decoder architecture which uses abstract image feature vectors as input to the encoder.
Image Captioning Object