1 code implementation • NeurIPS 2022 • Zelun Luo, Zane Durante, Linden Li, Wanze Xie, Ruochen Liu, Emily Jin, Zhuoyi Huang, Lun Yu Li, Jiajun Wu, Juan Carlos Niebles, Ehsan Adeli, Fei-Fei Li
Video-language models (VLMs), large models pre-trained on numerous but noisy video-text pairs from the internet, have revolutionized activity recognition through their remarkable generalization and open-vocabulary capabilities.
Ranked #2 on
Few Shot Action Recognition
on MOMA-LRG
(using extra training data)
no code implementations • NeurIPS 2021 • Zelun Luo, Wanze Xie, Siddharth Kapoor, Yiyun Liang, Michael Cooper, Juan Carlos Niebles, Ehsan Adeli, Fei-Fei Li
This paper introduces Activity Parsing as the overarching task of temporal segmentation and classification of activities, sub-activities, atomic actions, along with an instance-level understanding of actors, objects, and their relationships in videos.
1 code implementation • 15 Jan 2021 • Junshen Kevin Chen, Wanze Xie, Yutong He
In this project, we leverage a trained single-letter classifier to predict the written word from a continuously written word sequence, by designing a word reconstruction pipeline consisting of a dynamic-programming algorithm and an auto-correction model.
1 code implementation • 15 Jan 2021 • Junshen Kevin Chen, Wanze Xie, Yutong He
We attempt to overcome the restriction of requiring a writing surface for handwriting recognition.