no code implementations • 16 May 2023 • Yi Huang, Asim Kadav, Farley Lai, Deep Patel, Hans Peter Graf
Specifically, KeyNet introduces the use of object based keypoint information to capture context in the scene.
1 code implementation • 11 Dec 2021 • Honglu Zhou, Asim Kadav, Aviv Shamsian, Shijie Geng, Farley Lai, Long Zhao, Ting Liu, Mubbasir Kapadia, Hans Peter Graf
Group Activity Recognition detects the activity collectively performed by a group of actors, which requires compositional reasoning of actors and objects.
Ranked #2 on
Group Activity Recognition
on Collective Activity
1 code implementation • ICLR 2021 • Honglu Zhou, Asim Kadav, Farley Lai, Alexandru Niculescu-Mizil, Martin Renqiang Min, Mubbasir Kapadia, Hans Peter Graf
We evaluate over CATER dataset and find that Hopper achieves 73. 2% Top-1 accuracy using just 1 FPS by hopping through just a few critical frames.
Ranked #5 on
Video Object Tracking
on CATER
no code implementations • CVPR 2020 • Yizhe Zhu, Martin Renqiang Min, Asim Kadav, Hans Peter Graf
We propose a sequential variational autoencoder to learn disentangled representations of sequential data (e. g., videos and audios) under self-supervision.
no code implementations • CVPR 2020 • Michael Snower, Asim Kadav, Farley Lai, Hans Peter Graf
Keypoints are tracked using our Pose Entailment method, in which, first, a pair of pose estimates is sampled from different frames in a video and tokenized.
Ranked #2 on
Pose Tracking
on PoseTrack2017
no code implementations • 22 Apr 2019 • Meera Hahn, Asim Kadav, James M. Rehg, Hans Peter Graf
Localizing moments in untrimmed videos via language queries is a new and interesting task that requires the ability to accurately ground language into video.
no code implementations • 16 Nov 2017 • Chih-Yao Ma, Asim Kadav, Iain Melvin, Zsolt Kira, Ghassan AlRegib, Hans Peter Graf
We address the problem of video captioning by grounding language generation on object interactions in the video.
no code implementations • CVPR 2018 • Chih-Yao Ma, Asim Kadav, Iain Melvin, Zsolt Kira, Ghassan AlRegib, Hans Peter Graf
Human actions often involve complex interactions across several inter-related objects in the scene.
21 code implementations • 31 Aug 2016 • Hao Li, Asim Kadav, Igor Durdanovic, Hanan Samet, Hans Peter Graf
However, magnitude-based pruning of weights reduces a significant number of parameters from the fully connected layers and may not adequately reduce the computation costs in the convolutional layers due to irregular sparsity in the pruned networks.
Ranked #1 on
Network Pruning
on ImageNet