Search Results for author: Muyang He

Found 3 papers, 3 papers with code

Efficient Multimodal Learning from Data-centric Perspective

1 code implementation18 Feb 2024 Muyang He, Yexin Liu, Boya Wu, Jianhao Yuan, Yueze Wang, Tiejun Huang, Bo Zhao

Multimodal Large Language Models (MLLMs) have demonstrated notable capabilities in general visual understanding and reasoning tasks.

SVIT: Scaling up Visual Instruction Tuning

2 code implementations9 Jul 2023 Bo Zhao, Boya Wu, Muyang He, Tiejun Huang

Thanks to the emerging of foundation models, the large language and vision models are integrated to acquire the multimodal ability of visual captioning, question answering, etc.

GPT-4 Image Captioning +1

Large-scale Dataset Pruning with Dynamic Uncertainty

1 code implementation8 Jun 2023 Muyang He, Shuo Yang, Tiejun Huang, Bo Zhao

The state of the art of many learning tasks, e. g., image classification, is advanced by collecting larger datasets and then training larger models on them.

Image Classification

Cannot find the paper you are looking for? You can Submit a new open access paper.