Hi-Phy is a benchmark for physical reasoning that allows researchers to test individual physical reasoning capabilities. Inspired by how humans acquire these capabilities, the benchmark proposes a general hierarchy of physical reasoning capabilities with increasing complexity. this benchmark tests capabilities according to this hierarchy through generated physical reasoning tasks in the video game Angry Birds.
Paper | Code | Results | Date | Stars |
---|