no code implementations • 16 Apr 2019 • Basura Fernando, Cheston Tan Yin Chet, Hakan Bilen
Detecting temporal extents of human actions in videos is a challenging computer vision problem that requires detailed manual supervision including frame-level labels.