PTR is a new large-scale diagnostic visual reasoning dataset for research around part-based conceptual, relational and physical reasoning. PTR contains around 70k RGBD synthetic images with ground truth object and part level annotations regarding semantic instance segmentation, color attributes, spatial and geometric relationships, and certain physical properties such as stability. These images are paired with 700k machine-generated questions covering various types of reasoning types.
Paper | Code | Results | Date | Stars |
---|