The MARCELL Legislative Corpus

LREC 2020 Tam{\'a}s V{\'a}radiSvetla KoevaMartin YamalovMarko Tadi{\'c}B{\'a}lint SassBart{\l}omiej Nito{\'n}Maciej OgrodniczukPiotr P{\k{e}}zikVerginica Barbu MititeluRadu IonElena IrimiaMaria MitrofanVasile P{\u{a}}i{\textcommabelow{s}}Dan Tufi{\textcommabelow{s}}Radovan Garab{\'\i}kSimon KrekAndraz ReparMatja{\v{z}} RihtarJanez Brank

This article presents the current outcomes of the MARCELL CEF Telecom project aiming to collect and deeply annotate a large comparable corpus of legal documents. The MARCELL corpus includes 7 monolingual sub-corpora (Bulgarian, Croatian, Hungarian, Polish, Romanian, Slovak and Slovenian) containing the total body of respective national legislative documents... (read more)

PDF Abstract


No code implementations yet. Submit your code now


Results from the Paper

  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods used in the Paper

🤖 No Methods Found Help the community by adding them if they're not listed; e.g. Deep Residual Learning for Image Recognition uses ResNet