Search Results for author: Kerui Zhang

Found 2 papers, 2 papers with code

Can Vision-Language Models be a Good Guesser? Exploring VLMs for Times and Location Reasoning

1 code implementation12 Jul 2023 Gengyuan Zhang, Yurui Zhang, Kerui Zhang, Volker Tresp

This makes us wonder if, based on visual cues, Vision-Language Models that are pre-trained with large-scale image-text resources can achieve and even outperform human's capability in reasoning times and location.

Cannot find the paper you are looking for? You can Submit a new open access paper.