no code implementations • LREC 2020 • Tam{\'a}s V{\'a}radi, Svetla Koeva, Martin Yamalov, Marko Tadi{\'c}, B{\'a}lint Sass, Bart{\l}omiej Nito{\'n}, Maciej Ogrodniczuk, Piotr P{\k{e}}zik, Verginica Barbu Mititelu, Radu Ion, Elena Irimia, Maria Mitrofan, Vasile P{\u{a}}i{\textcommabelow{s}}, Dan Tufi{\textcommabelow{s}}, Radovan Garab{\'\i}k, Simon Krek, Andraz Repar, Matja{\v{z}} Rihtar, Janez Brank
This article presents the current outcomes of the MARCELL CEF Telecom project aiming to collect and deeply annotate a large comparable corpus of legal documents.
no code implementations • LREC 2020 • Bojana Mikeleni{\'c}, Marko Tadi{\'c}
This limits the usability of the corpus for research of language units at sentence and lower language levels only.
no code implementations • LREC 2016 • Ines Cebovi{\'c}, Marko Tadi{\'c}
The mk-hr{\_}pcorp is a unidirectional (mk→hr) parallel corpus composed of synchronic fictional prose texts received already in digital form with over 500 Kw in each language.
no code implementations • LREC 2014 • Georg Rehm, Hans Uszkoreit, Sophia Ananiadou, N{\'u}ria Bel, Audron{\.e} Bielevi{\v{c}}ien{\.e}, Lars Borin, Ant{\'o}nio Branco, Gerhard Budin, Nicoletta Calzolari, Walter Daelemans, Radovan Garab{\'\i}k, Marko Grobelnik, Carmen Garc{\'\i}a-Mateo, Josef van Genabith, Jan Haji{\v{c}}, Inma Hern{\'a}ez, John Judge, Svetla Koeva, Simon Krek, Cvetana Krstev, Krister Lind{\'e}n, Bernardo Magnini, Joseph Mariani, John McNaught, Maite Melero, Monica Monachini, Asunci{\'o}n Moreno, Jan Odijk, Maciej Ogrodniczuk, Piotr P{\k{e}}zik, Stelios Piperidis, Adam Przepi{\'o}rkowski, Eir{\'\i}kur R{\"o}gnvaldsson, Michael Rosner, Bolette Pedersen, Inguna Skadi{\c{n}}a, Koenraad De Smedt, Marko Tadi{\'c}, Paul Thompson, Dan Tufi{\c{s}}, Tam{\'a}s V{\'a}radi, Andrejs Vasi{\c{l}}jevs, Kadri Vider, Jolanta Zabarskaite
This article provides an overview of the dissemination work carried out in META-NET from 2010 until early 2014; we describe its impact on the regional, national and international level, mainly with regard to politics and the situation of funding for LT topics.
no code implementations • LREC 2014 • {\v{Z}}eljko Agi{\'c}, Da{\v{s}}a Berovi{\'c}, Danijela Merkler, Marko Tadi{\'c}
Introducing the new annotation to Croatian Dependency Treebank, we also modify head attachment rules addressing subordinate conjunctions and subordinate clause predicates.
no code implementations • LREC 2014 • Llu{\'\i}s Padr{\'o}, {\v{Z}}eljko Agi{\'c}, Xavier Carreras, Blaz Fortuna, Esteban Garc{\'\i}a-Cuesta, Zhixing Li, Tadej {\v{S}}tajner, Marko Tadi{\'c}
This paper presents the linguistic analysis tools and its infrastructure developed within the XLike project.
no code implementations • LREC 2014 • Achim Rettinger, Lei Zhang, Da{\v{s}}a Berovi{\'c}, Danijela Merkler, Matea Sreba{\v{c}}i{\'c}, Marko Tadi{\'c}
To support this line of research we developed what we believe could serve as a gold standard Resource for Evaluating Cross-lingual Semantic Annotation (RECSA).
no code implementations • LREC 2014 • Kre{\v{s}}imir {\v{S}}ojat, Matea Sreba{\v{c}}i{\'c}, Marko Tadi{\'c}, Tin Paveli{\'c}
Language data structured in this way was further used for the expansion of other language resources for Croatian, such as Croatian WordNet and the Croatian Morphological Lexicon.
no code implementations • LREC 2012 • Kre{\v{s}}imir {\v{S}}ojat, Nives Mikeli{\'c} Preradovi{\'c}, Marko Tadi{\'c}
The paper presents a procedure for generating prefixed verbs in Croatian comprising combinations of one, two or three prefixes.
no code implementations • LREC 2012 • Da{\v{s}}a Berovi{\'c}, {\v{Z}}eljko Agi{\'c}, Marko Tadi{\'c}
We present the current state of development of the Croatian Dependency Treebank ― with special empahsis on adapting the Prague Dependency Treebank formalism to Croatian language specifics ― and illustrate its possible applications in an experiment with dependency parsing using MaltParser.