TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Visual Reasoning	PHYRE-1B-Cross	Dec[Joint]1f	AUCCESS	40.3	# 2
Visual Reasoning	PHYRE-1B-Within	Dec[Joint]1f	AUCCESS	80.0	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/forward-prediction-for-physical-reasoning/visual-reasoning-on-phyre-1b-cross)](https://paperswithcode.com/sota/visual-reasoning-on-phyre-1b-cross?p=forward-prediction-for-physical-reasoning)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/forward-prediction-for-physical-reasoning/visual-reasoning-on-phyre-1b-within)](https://paperswithcode.com/sota/visual-reasoning-on-phyre-1b-within?p=forward-prediction-for-physical-reasoning)`

Forward Prediction for Physical Reasoning

18 Jun 2020 · Rohit Girdhar, Laura Gustafson, Aaron Adcock, Laurens van der Maaten ·

Physical reasoning requires forward prediction: the ability to forecast what will happen next given some initial world state. We study the performance of state-of-the-art forward-prediction models in the complex physical-reasoning tasks of the PHYRE benchmark. We do so by incorporating models that operate on object or pixel-based representations of the world into simple physical-reasoning agents. We find that forward-prediction models can improve physical-reasoning performance, particularly on complex tasks that involve many objects. However, we also find that these improvements are contingent on the test tasks being small variations of train tasks, and that generalization to completely new task templates is challenging. Surprisingly, we observe that forward predictors with better pixel accuracy do not necessarily lead to better physical-reasoning performance.Nevertheless, our best models set a new state-of-the-art on the PHYRE benchmark.

PDF Abstract