no code implementations • 4 Dec 2019 • Jonathan C. Stroud, Ryan McCaffrey, Rada Mihalcea, Jia Deng, Olga Russakovsky
Temporal grounding entails establishing a correspondence between natural language event descriptions and their visual depictions.
Visual Grounding