no code implementations • 18 Apr 2024 • Xunsong Li, Pengzhan Sun, Yangcen Liu, Lixin Duan, Wen Li
Existing methods usually adopt a two-stage pipeline, where object proposals are first detected using a pretrained detector, and then are fed to an action recognition model for extracting video features and learning the object relations for action recognition.
1 code implementation • ACM International Conference on Multimedia 2021 • Pengzhan Sun, Bo Wu, Xunsong Li, Wen Li, Lixin Duan, Chuang Gan
By doing that, our proposed CDN method can better recognize unseen action instances by debiasing the effect of appearances.