# Where to put the Image in an Image Caption Generator

27 Mar 2017Marc TantiAlbert GattKenneth P. Camilleri

When a recurrent neural network language model is used for caption generation, the image information can be fed to the neural network either by directly incorporating it in the RNN -- conditioning the language model by injecting' image features -- or in a layer following the RNN -- conditioning the language model by merging' image features. While both options are attested in the literature, there is as yet no systematic comparison between the two... (read more)

