no code implementations • LREC 2020 • Filip Klubi{\v{c}}ka, Alfredo Maldonado, Abhijit Mahalunkar, John Kelleher
Our WordNet taxonomic random walk implementation allows the exploration of different random walk hyperparameters and the generation of a variety of different pseudo-corpora.
no code implementations • SEMEVAL 2018 • Alfredo Maldonado, Filip Klubi{\v{c}}ka
This paper describes a simple but competitive unsupervised system for hypernym discovery.
Ranked #6 on Hypernym Discovery on Music domain
no code implementations • WS 2016 • Maja Popovi{\'c}, Mihael Ar{\v{c}}an, Filip Klubi{\v{c}}ka
This work explores the obstacles for machine translation systems between closely related South Slavic languages, namely Croatian, Serbian and Slovenian.
no code implementations • LREC 2016 • Nikola Ljube{\v{s}}i{\'c}, Filip Klubi{\v{c}}ka, {\v{Z}}eljko Agi{\'c}, Ivo-Pavao Jazbec
In this paper we present newly developed inflectional lexcions and manually annotated corpora of Croatian and Serbian.
no code implementations • LREC 2016 • Nikola Ljube{\v{s}}i{\'c}, Miquel Espl{\`a}-Gomis, Antonio Toral, Sergio Ortiz Rojas, Filip Klubi{\v{c}}ka
This paper presents an approach for building large monolingual corpora and, at the same time, extracting parallel data by crawling the top-level domain of a given language of interest.
no code implementations • LREC 2014 • Miquel Espl{\`a}-Gomis, Filip Klubi{\v{c}}ka, Nikola Ljube{\v{s}}i{\'c}, Sergio Ortiz-Rojas, Vassilis Papavassiliou, Prokopis Prokopidis
We used both tools for crawling 21 multilingual websites from the tourism domain to build a domain-specific English―Croatian parallel corpus.