TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Visual Question Answering (VQA)	COCO Visual Question Answering (VQA) real images 1.0 multiple choice	iBOWIMG baseline	Percentage correct	62.0	# 10
Visual Question Answering (VQA)	COCO Visual Question Answering (VQA) real images 1.0 open ended	iBOWIMG baseline	Percentage correct	55.9	# 14

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/simple-baseline-for-visual-question-answering/visual-question-answering-on-coco-visual-1)](https://paperswithcode.com/sota/visual-question-answering-on-coco-visual-1?p=simple-baseline-for-visual-question-answering)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/simple-baseline-for-visual-question-answering/visual-question-answering-on-coco-visual-4)](https://paperswithcode.com/sota/visual-question-answering-on-coco-visual-4?p=simple-baseline-for-visual-question-answering)`

Simple Baseline for Visual Question Answering

7 Dec 2015 · Bolei Zhou, Yuandong Tian, Sainbayar Sukhbaatar, Arthur Szlam, Rob Fergus ·

We describe a very simple bag-of-words baseline for visual question answering. This baseline concatenates the word features from the question and CNN features from the image to predict the answer. When evaluated on the challenging VQA dataset [2], it shows comparable performance to many recent approaches using recurrent neural networks. To explore the strength and weakness of the trained model, we also provide an interactive web demo and open-source code. .

PDF Abstract

Code

Add Remove Mark official

metalbubble/VQAbaseline official

186

sidaw/nbsvm

136

yikang-li/iqan

sidgan/whats_in_a_question

karunraju/VQA

See all 7 implementations

Tasks

Add Remove

Visual Question Answering

Visual Question Answering (VQA)

Datasets

MS COCO

Visual Question Answering

Results from the Paper

Edit

Ranked #10 on Visual Question Answering (VQA) on COCO Visual Question Answering (VQA) real images 1.0 multiple choice

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Visual Question Answering (VQA)	COCO Visual Question Answering (VQA) real images 1.0 multiple choice	iBOWIMG baseline	Percentage correct	62.0	# 10		Compare
Visual Question Answering (VQA)	COCO Visual Question Answering (VQA) real images 1.0 open ended	iBOWIMG baseline	Percentage correct	55.9	# 14		Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Simple Baseline for Visual Question Answering

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove