Videos as Space-Time Region Graphs

ECCV 2018 Xiaolong WangAbhinav Gupta

How do humans recognize the action "opening a book" ? We argue that there are two important cues: modeling temporal shape dynamics and modeling functional relationships between humans and objects... (read more)

PDF Abstract

Results from the Paper


#7 best model for Action Classification on Charades (using extra training data)

     Get a GitHub badge
TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK USES EXTRA
TRAINING DATA
RESULT LEADERBOARD
Action Classification Charades STRG MAP 39.7 # 7