no code implementations • 18 Apr 2024 • Xunsong Li, Pengzhan Sun, Yangcen Liu, Lixin Duan, Wen Li
Existing methods usually adopt a two-stage pipeline, where object proposals are first detected using a pretrained detector, and then are fed to an action recognition model for extracting video features and learning the object relations for action recognition.
1 code implementation • ICCV 2023 • Eslam Mohamed BAKR, Pengzhan Sun, Xiaoqian Shen, Faizan Farooq Khan, Li Erran Li, Mohamed Elhoseiny
A human evaluation aligned with 95% of our evaluations on average was conducted to probe the effectiveness of HRS-Bench.
no code implementations • 10 Apr 2023 • Eslam Mohamed BAKR, Pengzhan Sun, Li Erran Li, Mohamed Elhoseiny
In addition, we design a formulation for measuring the bias of generated captions as prompt-based image captioning instead of using language classifiers.
1 code implementation • ACM International Conference on Multimedia 2021 • Pengzhan Sun, Bo Wu, Xunsong Li, Wen Li, Lixin Duan, Chuang Gan
By doing that, our proposed CDN method can better recognize unseen action instances by debiasing the effect of appearances.