no code implementations • LREC 2020 • Dan Tufi{\textcommabelow{s}}, Maria Mitrofan, Vasile P{\u{a}}i{\textcommabelow{s}}, Radu Ion, Andrei Coman
We present the Romanian legislative corpus which is a valuable linguistic asset for the development of machine translation systems, especially for under-resourced languages.
no code implementations • LREC 2020 • Vasile P{\u{a}}i{\textcommabelow{s}}, Radu Ion, Dan Tufi{\textcommabelow{s}}
This paper presents RELATE (http://relate. racai. ro), a high-performance natural language platform designed for Romanian language.
no code implementations • LREC 2020 • Tam{\'a}s V{\'a}radi, Svetla Koeva, Martin Yamalov, Marko Tadi{\'c}, B{\'a}lint Sass, Bart{\l}omiej Nito{\'n}, Maciej Ogrodniczuk, Piotr P{\k{e}}zik, Verginica Barbu Mititelu, Radu Ion, Elena Irimia, Maria Mitrofan, Vasile P{\u{a}}i{\textcommabelow{s}}, Dan Tufi{\textcommabelow{s}}, Radovan Garab{\'\i}k, Simon Krek, Andraz Repar, Matja{\v{z}} Rihtar, Janez Brank
This article presents the current outcomes of the MARCELL CEF Telecom project aiming to collect and deeply annotate a large comparable corpus of legal documents.
no code implementations • LREC 2020 • Vasile Pais, Dan Tufi{\textcommabelow{s}}, Radu Ion
This paper describes RACAI{'}s word sense alignment system, which participated in the Monolingual Word Sense Alignment shared task organized at GlobaLex 2020 workshop.
no code implementations • LREC 2016 • Dan Tufi{\textcommabelow{s}}, Verginica Barbu Mititelu, Elena Irimia, {\textcommabelow{S}}tefan Daniel Dumitrescu, Tiberiu Boro{\textcommabelow{s}}
The article describes the current status of a large national project, CoRoLa, aiming at building a reference corpus for the contemporary Romanian language.
no code implementations • LREC 2014 • Verginica Barbu Mititelu, Elena Irimia, Dan Tufi{\textcommabelow{s}}
Our project is a joined effort of two institutes of the Romanian Academy.
no code implementations • LREC 2012 • Radu Ion, Elena Irimia, Dan {\c{S}}tef{\u{a}}nescu, Dan Tufi{\textcommabelow{s}}
This article describes the collecting, processing and validation of a large balanced corpus for Romanian.