no code implementations • 19 Apr 2022 • Aoi Otani, Ryota Hashiguchi, Kazuki Omi, Norishige Fukushima, Toru Tamaki
In general, action recognition models are trained on high-quality videos, hence it is not known how the model performance degrades when tested on low-quality videos, and how much the quality of training videos affects the performance.
no code implementations • 1 Apr 2022 • Ryota Hashiguchi, Toru Tamaki
We also investigate other variants in which the proposed MSCA is used along the patch dimension of ViT, instead of the head dimension.