1 code implementation • LREC 2018 • Lukas Svoboda, Slobodan Beliga
We created a new word analogy corpus based on the original English Word2vec word analogy corpus and added some of the specific linguistic aspects from Croatian language.
no code implementations • 27 Mar 2015 • Slobodan Beliga, Miran Pobar, Sanda Martinčić-Ipšić
This paper presents text normalization which is an integral part of any text-to-speech synthesis system.
no code implementations • 28 Aug 2014 • Slobodan Beliga, Sanda Martinčić-Ipšić
This paper presents categorization of Croatian texts using Non-Standard Words (NSW) as features.
no code implementations • 17 Jul 2014 • Slobodan Beliga, Ana Meštrović, Sanda Martinčić-Ipšić
Obtained sets are evaluated on a manually annotated keywords: for the set of extracted keyword candidates average F1 score is 24, 63%, and average F2 score is 21, 19%; for the exacted words-tuples candidates average F1 score is 25, 9% and average F2 score is 24, 47%.