no code implementations • 15 Nov 2023 • Serwan Jassim, Mario Holubar, Annika Richter, Cornelius Wolff, Xenia Ohmer, Elia Bruni
Our evaluation reveals significant shortcomings in the language grounding and intuitive physics capabilities of these models.