Visual Commonsense Tests
2 papers with code • 1 benchmarks • 1 datasets
Predict 5 property types (color, shape, material, size, and visual co-occurrence) for over 5000 subjects.
Most implemented papers
Visual Commonsense in Pretrained Unimodal and Multimodal Models
Our commonsense knowledge about objects includes their typical visual attributes; we know that bananas are typically yellow or green, and not purple.
Z-LaVI: Zero-Shot Language Solver Fueled by Visual Imagination
Large-scale pretrained language models have made significant advances in solving downstream language understanding tasks.