no code implementations • LREC 2014 • J{\'u}lia Pajzs, Ralf Steinberger, Maud Ehrmann, Mohamed Ebrahim, Leonida della Rocca, Stefano Bucci, Eszter Simon, Tam{\'a}s V{\'a}radi
In this paper, we describe the effort of adding to EMM Hungarian text mining tools for news gathering; document categorisation; named entity recognition and classification for persons, organisations and locations; name lemmatisation; quotation recognition; and cross-lingual linking of related news clusters.
no code implementations • LREC 2012 • Ralf Steinberger, Mohamed Ebrahim, Marco Turchi
EuroVoc (2012) is a highly multilingual thesaurus consisting of over 6, 700 hierarchically organised subject domains used by European Institutions and many authorities in Member States of the European Union (EU) for the classification and retrieval of official documents.