no code implementations • 30 Jan 2023 • Maksud Sharipov, Elmurod Kuriyozov, Ollabergan Yuldashev, Ogabek Sobirov
This research paper presents a part-of-speech (POS) annotated dataset and tagger tool for the low-resource Uzbek language.
no code implementations • 28 Oct 2022 • Maksud Sharipov, Ollabergan Yuldashov
In this paper we present a rule-based stemming algorithm for the Uzbek language.
no code implementations • 28 Oct 2022 • Maksud Sharipov, Ogabek Sobirov
This lemmatization consists of the general rules and a part of speech data of the Uzbek language, affixes, classification of affixes, removing affixes on the basis of the finite state machine for each class, as well as a definition of this word lemma.
no code implementations • 27 Oct 2022 • Maksud Sharipov, Jamolbek Mattiev, Jasur Sobirov, Rustam Baltayev
Nowadays, creation of the tagged corpora is becoming one of the most important tasks of Natural Language Processing (NLP).
no code implementations • 20 May 2022 • Maksud Sharipov, Ulugbek Salaev
This work presents a morphological analyzer for the Uzbek language using a finite state machine.