no code implementations • 14 Oct 2022 • Iulia-Maria Comsa, Julian Martin Eisenschlos, Srini Narayanan
We propose a benchmark to assess the capability of large language models to reason with conventional metaphors.