1 code implementation • 5 Feb 2024 • Yang Jin, Zhicheng Sun, Kun Xu, Liwei Chen, Hao Jiang, Quzhe Huang, Chengru Song, Yuliang Liu, Di Zhang, Yang song, Kun Gai, Yadong Mu
In light of recent advances in multimodal Large Language Models (LLMs), there is increasing attention to scaling them from image-text data to more informative real-world videos.
Ranked #63 on Visual Question Answering on MM-Vet
1 code implementation • CVPR 2023 • Zhicheng Sun, Yadong Mu, Gang Hua
Continual learning aims to learn on non-stationary data streams without catastrophically forgetting previous knowledge.
1 code implementation • ACM Multimedia 2022 • Zhicheng Sun, Yadong Mu
The task of lifelong person re-identification aims to match a person across multiple cameras given continuous data streams.