no code implementations • LREC 2022 • Zdeněk Žabokrtský, Niyati Bafna, Jan Bodnár, Lukáš Kyjánek, Emil Svoboda, Magda Ševčíková, Jonáš Vidra
Our work aims at developing a multilingual data resource for morphological segmentation.
no code implementations • 6 Jun 2019 • Tomáš Musil, Jonáš Vidra, David Mareček
Derivation is a type of a word-formation process which creates new words from existing ones by adding, changing or deleting affixes.
1 code implementation • 14 Jun 2018 • Dominik Macháček, Jonáš Vidra, Ondřej Bojar
The state of the art of handling rich morphology in neural machine translation (NMT) is to break word forms into subword units, so that the overall vocabulary size of these units fits the practical limits given by the NMT model and GPU memory capacity.