Search Results for author: Jialou Wang

Found 1 papers, 0 papers with code

Detect2Interact: Localizing Object Key Field in Visual Question Answering (VQA) with LLMs

no code implementations1 Apr 2024 Jialou Wang, Manli Zhu, Yulei Li, Honglei Li, Longzhi Yang, Wai Lok Woo

As a result, Detect2Interact achieves consistent qualitative results on object key field detection across extensive test cases and outperforms the existing VQA system with object detection by providing a more reasonable and finer visual representation.

Common Sense Reasoning Object +4

Cannot find the paper you are looking for? You can Submit a new open access paper.