no code implementations • 4 Oct 2012 • Hema Swetha Koppula, Rudhir Gupta, Ashutosh Saxena
Given a RGB-D video, we jointly model the human activities and object affordances as a Markov random field where the nodes represent objects and sub-activities, and the edges represent the relationships between object affordances, their relations with sub-activities, and their evolution over time.
Ranked #3 on Skeleton Based Action Recognition on CAD-120