no code implementations • 25 Oct 2023 • Alon Goldstein, Miriam Havin, Roi Reichart, Ariel Goldstein
This paper investigates the problem-solving capabilities of Large Language Models (LLMs) by evaluating their performance on stumpers, unique single-step intuition problems that pose challenges for human solvers but are easily verifiable.