1 code implementation • 4 Nov 2024 • Yanyi Zhang, Binglin Qiu, Qi Jia, Yu Liu, Ran He
Most incremental learners excessively prioritize coarse classes of objects while neglecting various kinds of states (e. g. color and material) attached to the objects.
no code implementations • 9 Mar 2024 • Yanyi Zhang, Qi Jia, Xin Fan, Yu Liu, Ran He
Inspired by this, we propose a novel A-O disentangled framework for CZSL, namely Class-specified Cascaded Network (CSCNet).
no code implementations • ICCV 2021 • Yanyi Zhang, Xinyu Li, Chunhui Liu, Bing Shuai, Yi Zhu, Biagio Brattoli, Hao Chen, Ivan Marsic, Joseph Tighe
We first introduce the vanilla video transformer and show that transformer module is able to perform spatio-temporal modeling from raw pixels, but with heavy memory usage.
Ranked #15 on
Action Classification
on Charades
1 code implementation • CVPR 2022 • Jiaojiao Zhao, Yanyi Zhang, Xinyu Li, Hao Chen, Shuai Bing, Mingze Xu, Chunhui Liu, Kaustav Kundu, Yuanjun Xiong, Davide Modolo, Ivan Marsic, Cees G. M. Snoek, Joseph Tighe
We propose TubeR: a simple solution for spatio-temporal video action detection.
no code implementations • CVPR 2021 • Yanyi Zhang, Xinyu Li, Ivan Marsic
Multi-label activity recognition is designed for recognizing multiple activities that are performed simultaneously or sequentially in each video.
no code implementations • 6 Dec 2018 • Yanyi Zhang, Xinyu Li, Kaixiang Huang, Yehan Wang, Shuhong Chen, Ivan Marsic
We present a system for concurrent activity recognition.
no code implementations • 28 Feb 2017 • Xinyu Li, Yanyi Zhang, Jianyu Zhang, Yueyang Chen, Shuhong Chen, Yue Gu, Moliang Zhou, Richard A. Farneth, Ivan Marsic, Randall S. Burd
For the Olympic swimming dataset, our system achieved an accuracy of 88%, an F1-score of 0. 58, a completeness estimation error of 6. 3% and a remaining-time estimation error of 2. 9 minutes.
no code implementations • 10 Feb 2017 • Xinyu Li, Yanyi Zhang, Ivan Marsic, Randall S. Burd
We introduce a novel, accurate and practical system for real-time people tracking and identification.
no code implementations • 6 Feb 2017 • Xinyu Li, Yanyi Zhang, Jianyu Zhang, Shuhong Chen, Ivan Marsic, Richard A. Farneth, Randall S. Burd
Our system is the first to address the concurrent activity recognition with multisensory data using a single model, which is scalable, simple to train and easy to deploy.