The Drive&Act dataset is a state of the art multi modal benchmark for driver behavior recognition. The dataset includes 3D skeletons in addition to frame-wise hierarchical labels of 9.6 Million frames captured by 6 different views and 3 modalities (RGB, IR and depth).
It offers following key features:
12h of video data in 29 long sequences
Calibrated multi view camera system with 5 views
Multi modal videos: NIR, Depth and Color data
Markerless motion capture: 3D Body Pose and Head Pose