Search Results for author: Feiyang Lv

Found 2 papers, 2 papers with code

ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding

1 code implementation5 Aug 2022 Bingning Wang, Feiyang Lv, Ting Yao, Yiming Yuan, Jin Ma, Yu Luo, Haijin Liang

However, in most of the public visual question answering datasets such as VQA, CLEVR, the questions are human generated that specific to the given image, such as `What color are her eyes?'.

Image Retrieval Question Answering +2

Cannot find the paper you are looking for? You can Submit a new open access paper.