Search Results for author: Minyan Zeng

Found 1 papers, 1 papers with code

Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language Models

1 code implementation28 Aug 2024 Wenbin Wang, Liang Ding, Minyan Zeng, Xiabin Zhou, Li Shen, Yong Luo, DaCheng Tao

Building upon this insight, we propose Divide, Conquer and Combine (DC$^2$), a novel training-free framework for enhancing MLLM perception of HR images.

2k 4k +1

Cannot find the paper you are looking for? You can Submit a new open access paper.