Search Results for author: Sirui Cheng

Found 3 papers, 1 papers with code

KNVQA: A Benchmark for evaluation knowledge-based VQA

no code implementations21 Nov 2023 Sirui Cheng, Siyu Zhang, Jiayi Wu, Muchen Lan

Within the multimodal field, large vision-language models (LVLMs) have made significant progress due to their strong perception and reasoning capabilities in the visual and language systems.

Hallucination Visual Question Answering (VQA)

Multiscale Superpixel Structured Difference Graph Convolutional Network for VL Representation

no code implementations20 Oct 2023 Siyu Zhang, Yeming Chen, Sirui Cheng, Yaoru Sun, Jun Yang, Lizhi Bai

It parses the entire image as a fine-to-coarse hierarchical structure of constituent visual patterns, and captures multiscale features by progressively merging adjacent superpixels as graph nodes.

Self-Supervised Learning Superpixels +1

Evaluating Open-QA Evaluation

1 code implementation NeurIPS 2023 Cunxiang Wang, Sirui Cheng, Qipeng Guo, Yuanhao Yue, Bowen Ding, Zhikun Xu, Yidong Wang, Xiangkun Hu, Zheng Zhang, Yue Zhang

This study focuses on the evaluation of the Open Question Answering (Open-QA) task, which can directly estimate the factuality of large language models (LLMs).

Question Answering

Cannot find the paper you are looking for? You can Submit a new open access paper.