Localizing Moments in Video with Natural Language

ICCV 2017 Lisa Anne HendricksOliver WangEli ShechtmanJosef SivicTrevor DarrellBryan Russell

We consider retrieving a specific temporal segment, or moment, from a video given a natural language text description. Methods designed to retrieve whole video clips with natural language determine what occurs in a video but not when... (read more)

PDF Abstract ICCV 2017 PDF ICCV 2017 Abstract

Results from the Paper


  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods used in the Paper


METHOD TYPE
🤖 No Methods Found Help the community by adding them if they're not listed; e.g. Deep Residual Learning for Image Recognition uses ResNet