Search Results for author: Zizheng Huang

Found 6 papers, 6 papers with code

Efficient Transfer Learning for Video-language Foundation Models

1 code implementation18 Nov 2024 Haoxing Chen, Zizheng Huang, Yan Hong, Yanshuo Wang, Zhongcai Lyu, Zhuoer Xu, Jun Lan, Zhangxuan Gu

Pre-trained vision-language models provide a robust foundation for efficient transfer learning across various downstream tasks.

Action Recognition Few-Shot Learning +3

Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training

1 code implementation30 Aug 2024 Zizheng Huang, Haoxing Chen, Jiaqi Li, Jun Lan, Huijia Zhu, Weiqiang Wang, LiMin Wang

Recent Vision Mamba models not only have much lower complexity for processing higher resolution images and longer videos but also the competitive performance with Vision Transformers (ViTs).

Image Classification Mamba +2

DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark

1 code implementation30 May 2024 Haoxing Chen, Yan Hong, Zizheng Huang, Zhuoer Xu, Zhangxuan Gu, Yaohui Li, Jun Lan, Huijia Zhu, Jianfu Zhang, Weiqiang Wang, Huaxiong Li

We believe that the GenVideo dataset and the DeMamba module will significantly advance the field of AI-generated video detection.

DeepFake Detection Mamba +4

Conditional Prototype Rectification Prompt Learning

1 code implementation15 Apr 2024 Haoxing Chen, Yaohui Li, Zizheng Huang, Yan Hong, Zhuoer Xu, Zhangxuan Gu, Jun Lan, Huijia Zhu, Weiqiang Wang

Recent advancements in efficient transfer learning (ETL) have shown remarkable success in fine-tuning VLMs within the scenario of limited data, introducing only a few parameters to harness task-specific insights from VLMs.

Few-Shot Learning Transfer Learning

Boosting Audio-visual Zero-shot Learning with Large Language Models

1 code implementation21 Nov 2023 Haoxing Chen, Yaohui Li, Yan Hong, Zizheng Huang, Zhuoer Xu, Zhangxuan Gu, Jun Lan, Huijia Zhu, Weiqiang Wang

Recent methods mainly focus on learning multi-modal features aligned with class names to enhance the generalization ability to unseen categories.

audio-visual learning Descriptive +1

Cannot find the paper you are looking for? You can Submit a new open access paper.