Visual-Textual Capsule Routing for Text-Based Video Segmentation

CVPR 2020 Bruce McIntosh Kevin Duarte Yogesh S Rawat Mubarak Shah

Joint understanding of vision and natural language is a challenging problem with a wide range of applications in artificial intelligence. In this work, we focus on integration of video and text for the task of actor and action video segmentation from a sentence... (read more)

PDF Abstract

Results from the Paper


  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods used in the Paper