1 code implementation • 9 Jun 2023 • Judit Acs, Endre Hamerlik, Roy Schwartz, Noah A. Smith, Andras Kornai
We introduce an extensive dataset for multilingual probing of morphological information in language models (247 tasks across 42 languages from 10 families), each consisting of a sentence with a target word and a morphological tag as the desired label, derived from the Universal Dependencies treebanks.
no code implementations • 14 Sep 2021 • Gabor Szolnok, Botond Barta, Dorina Lakatos, Judit Acs
We present the BME submission for the SIGMORPHON 2021 Task 0 Part 1, Generalization Across Typologically Diverse Languages shared task.
1 code implementation • 8 Dec 2020 • Judit Acs, Andras Kornai
By training the models on the same source but different target, we can compare what subwords are important for different tasks and how they relate to each other.