1 code implementation • 18 Nov 2024 • Haoxing Chen, Zizheng Huang, Yan Hong, Yanshuo Wang, Zhongcai Lyu, Zhuoer Xu, Jun Lan, Zhangxuan Gu
Pre-trained vision-language models provide a robust foundation for efficient transfer learning across various downstream tasks.
1 code implementation • 30 Aug 2024 • Zizheng Huang, Haoxing Chen, Jiaqi Li, Jun Lan, Huijia Zhu, Weiqiang Wang, LiMin Wang
Recent Vision Mamba models not only have much lower complexity for processing higher resolution images and longer videos but also the competitive performance with Vision Transformers (ViTs).
1 code implementation • 30 May 2024 • Haoxing Chen, Yan Hong, Zizheng Huang, Zhuoer Xu, Zhangxuan Gu, Yaohui Li, Jun Lan, Huijia Zhu, Jianfu Zhang, Weiqiang Wang, Huaxiong Li
We believe that the GenVideo dataset and the DeMamba module will significantly advance the field of AI-generated video detection.
1 code implementation • 15 Apr 2024 • Haoxing Chen, Yaohui Li, Zizheng Huang, Yan Hong, Zhuoer Xu, Zhangxuan Gu, Jun Lan, Huijia Zhu, Weiqiang Wang
Recent advancements in efficient transfer learning (ETL) have shown remarkable success in fine-tuning VLMs within the scenario of limited data, introducing only a few parameters to harness task-specific insights from VLMs.
1 code implementation • 21 Nov 2023 • Haoxing Chen, Yaohui Li, Yan Hong, Zizheng Huang, Zhuoer Xu, Zhangxuan Gu, Jun Lan, Huijia Zhu, Weiqiang Wang
Recent methods mainly focus on learning multi-modal features aligned with class names to enhance the generalization ability to unseen categories.
Ranked #1 on
GZSL Video Classification
on ActivityNet-GZSL (cls)
1 code implementation • 16 Jul 2022 • Zizheng Huang, Haoxing Chen, Ziqi Wen, Chao Zhang, Huaxiong Li, Bo wang, Chunlin Chen
Contrastive learning (CL) continuously achieves significant breakthroughs across multiple domains.