no code implementations • CVPR 2015 • Chuang Gan, Naiyan Wang, Yi Yang, Dit-yan Yeung, Alex G. Hauptmann
Taking key frames of videos as input, we first detect the event of interest at the video level by aggregating the CNN features of the key frames.