Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations

23 Feb 2016Ranjay KrishnaYuke ZhuOliver GrothJustin JohnsonKenji HataJoshua KravitzStephanie ChenYannis KalantidisLi-Jia LiDavid A. ShammaMichael S. BernsteinFei-Fei Li

Despite progress in perceptual tasks such as image classification, computers still perform poorly on cognitive tasks such as image description and question answering. Cognition is core to tasks that involve not just recognizing, but reasoning about our visual world... (read more)

PDF Abstract


No code implementations yet. Submit your code now

Results from the Paper

  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.