The Q-Bench includes three realms for low-level vision: perception (A1), description (A2), and assessment (A3). - For perception (A1) /description (A2), we collect two benchmark datasets LLVisionQA/LLDescribe. - For assessment (A3), as we use public datasets, we provide an abstract evaluation code for arbitrary MLLMs for anyone to test.
Paper | Code | Results | Date | Stars |
---|