Search Results for author: Jingxu Yang

Found 2 papers, 2 papers with code

Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models

1 code implementation CVPR 2024 Zhang Li, Biao Yang, Qiang Liu, Zhiyin Ma, Shuo Zhang, Jingxu Yang, Yabo Sun, Yuliang Liu, Xiang Bai

Additionally, experiments on 18 datasets further demonstrate that Monkey surpasses existing LMMs in many tasks like Image Captioning and various Visual Question Answering formats.

Image Captioning Question Answering +2

Cannot find the paper you are looking for? You can Submit a new open access paper.