Visual Question Answering (VQA) v2.0 is a dataset containing open-ended questions about images. These questions require an understanding of vision, language and commonsense knowledge to answer. It is the second version of the VQA dataset.
The first version of the dataset was released in October 2015.
Paper | Code | Results | Date | Stars |
---|