G-VUE (General-purpose Visual Understanding Evaluation)

Introduced by Huang et al. in Perceive, Ground, Reason, and Act: A Benchmark for General-purpose Visual Representation

General-purpose Visual Understanding Evaluation (G-VUE) is a comprehensive benchmark covering the full spectrum of visual cognitive abilities with four functional domains -- Perceive, Ground, Reason, and Act. The four domains are embodied in 11 carefully curated tasks, from 3D reconstruction to visual reasoning and manipulation.

Source: Perceive, Ground, Reason, and Act: A Benchmark for General-purpose Visual Representation

Papers


Paper Code Results Date Stars

Dataset Loaders


No data loaders found. You can submit your data loader here.

Tasks


Similar Datasets


License


  • Unknown

Modalities


Languages