Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning

CVPR 2017 Jiasen LuCaiming XiongDevi ParikhRichard Socher

Attention-based neural encoder-decoder frameworks have been widely adopted for image captioning. Most methods force visual attention to be active for every generated word... (read more)

PDF Abstract

Results from the Paper

  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.