Neural Self Talk: Image Understanding via Continuous Questioning and Answering

In this paper we consider the problem of continuously discovering image contents by actively asking image based questions and subsequently answering the questions being asked. The key components include a Visual Question Generation (VQG) module and a Visual Question Answering module, in which Recurrent Neural Networks (RNN) and Convolutional Neural Network (CNN) are used... (read more)

PDF Abstract
No code implementations yet. Submit your code now
TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK RESULT BENCHMARK
Question Generation COCO Visual Question Answering (VQA) real images 1.0 open ended Max(Yang,2015) BLEU-1 59.4 # 3
Question Generation COCO Visual Question Answering (VQA) real images 1.0 open ended Sample(Yang,2015) BLEU-1 38.8 # 4

Methods used in the Paper


METHOD TYPE
🤖 No Methods Found Help the community by adding them if they're not listed; e.g. Deep Residual Learning for Image Recognition uses ResNet