Dataset of the Beacon3D benchmark: Unveiling the Mist over 3D Vision-Language Understanding: Object-centric Evaluation with Chain-of-Analysis.
This dataset includes test data for 3D grounding and QA tasks on ScanNet, 3RScan, and MultiScan. The dataset aims at providing a trustworthy testbed for 3D vision-language models.
Paper | Code | Results | Date | Stars |
---|