no code implementations • 16 Aug 2024 • Gonzalo Martínez, Juan Diego Molero, Sandra González, Javier Conde, Marc Brysbaert, Pedro Reviriego
In Study 1, ChatGPT-4o showed strong correlations with human concreteness ratings (r = . 8) for multi-word expressions.
1 code implementation • 23 Oct 2023 • Gonzalo Martínez, Javier Conde, Elena Merino-Gómez, Beatriz Bermúdez-Margaretto, José Alberto Hernández, Pedro Reviriego, Marc Brysbaert
Vocabulary tests, once a cornerstone of language modeling evaluation, have been largely overlooked in the current landscape of Large Language Models (LLMs) like Llama, Mistral, and GPT.