Factual Visual Question Answering
1 papers with code • 0 benchmarks • 2 datasets
Benchmarks
These leaderboards are used to track progress in Factual Visual Question Answering
Latest papers with no code
A survey on knowledge-enhanced multimodal learning
Multimodal learning has been a field of increasing interest, aiming to combine various modalities in a single joint representation.
Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering
Given a question-image pair, deep network techniques have been employed to successively reduce the large set of facts until one of the two entities of the final remaining fact is predicted as the answer.
Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering
Question answering is an important task for autonomous agents and virtual assistants alike and was shown to support the disabled in efficiently navigating an overwhelming environment.