Object Level Visual Reasoning in Videos

ECCV 2018 Fabien BaradelNatalia NeverovaChristian WolfJulien MilleGreg Mori

Human activity recognition is typically addressed by detecting key concepts like global and local motion, features related to object classes present in the scene, as well as features related to the global context. The next open challenges in activity recognition require a level of understanding that pushes beyond this and call for models with capabilities for fine distinction and detailed comprehension of interactions between actors and objects in a scene... (read more)

PDF Abstract
TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK RESULT LEADERBOARD
Semantic Object Interaction Classification VLOG Object Relation Network MAP 44.7 # 1