…We benchmark four foundational video understanding tasks: action recognition, action segmentation, object detection and multi-object tracking.
1 PAPER • NO BENCHMARKS YET