Search Results for author: Gilad Vered

Joint Optimization for Cooperative Image Captioning

This can be achieved by training two networks: a "speaker" that generates sentences given an image and a "listener" that uses them to perform a task.

Paper
Add Code

Second, we show that the generated descriptions can be kept close to natural by constraining them to be similar to human descriptions.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.