You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization

15 Nov 2019Okan KöpüklüXiangyu WeiGerhard Rigoll

Spatiotemporal action localization requires incorporation of two sources of information into the designed architecture: (1) Temporal information from the previous frames and (2) spatial information from the key frame. Current state-of-the-art approaches usually extract these information with separate networks and use an extra mechanism for fusion to get detections... (read more)

PDF Abstract


No code implementations yet. Submit your code now

Evaluation Results from the Paper

  Submit results from this paper to get state-of-the-art GitHub badges and help community compare results to other papers.