A Finite-state Morphological Analyser for Tuvan
{\textasciitilde}This paper describes the development of free/open-source finite-state morphological transducers for Tuvan, a Turkic language spoken in and around the Tuvan Republic in Russia. The finite-state toolkit used for the work is the Helsinki Finite-State Toolkit (HFST), we use the lexc formalism for modelling the morphotactics and twol formalism for modelling morphophonological alternations. We present a novel description of the morphological combinatorics of pseudo-derivational morphemes in Tuvan. An evaluation is presented which shows that the transducer has a reasonable coverage―around 93{\%}―on freely-available corpora of the languages, and high precision―over 99{\%}―on a manually verified test set.
PDF Abstract LREC 2016 PDF LREC 2016 Abstract