Search Results for author: Muchen Lan

Found 1 papers, 0 papers with code

KNVQA: A Benchmark for evaluation knowledge-based VQA

no code implementations21 Nov 2023 Sirui Cheng, Siyu Zhang, Jiayi Wu, Muchen Lan

Within the multimodal field, large vision-language models (LVLMs) have made significant progress due to their strong perception and reasoning capabilities in the visual and language systems.

Hallucination Visual Question Answering (VQA)

Cannot find the paper you are looking for? You can Submit a new open access paper.