1 code implementation • LREC 2020 • Sina Ahmadi, John Philip McCrae, Sanni Nimb, Fahad Khan, Monica Monachini, Bolette Pedersen, Thierry Declerck, Tanja Wissik, Bell, Andrea i, Irene Pisani, Thomas Troelsg{\aa}rd, Sussi Olsen, Simon Krek, Veronika Lipp, Tam{\'a}s V{\'a}radi, L{\'a}szl{\'o} Simon, Andr{\'a}s Gyorffy, Carole Tiberius, Tanneke Schoonheim, Yifat Ben Moshe, Maya Rudich, Raya Abu Ahmad, Dorielle Lonke, Kira Kovalenko, Margit Langemets, Jelena Kallas, Oksana Dereza, Theodorus Fransen, David Cillessen, David Lindemann, Mikel Alonso, Ana Salgado, Jos{\'e} Luis Sancho, Rafael-J. Ure{\~n}a-Ruiz, Jordi Porta Zamorano, Kiril Simov, Petya Osenova, Zara Kancheva, Ivaylo Radev, Ranka Stankovi{\'c}, Andrej Perdih, Dejan Gabrovsek
Aligning senses across resources and languages is a challenging task with beneficial applications in the field of natural language processing and electronic lexicography.
no code implementations • WS 2016 • L{\'a}szl{\'o} Hunyadi, Tam{\'a}s V{\'a}radi, Istv{\'a}n Szekr{\'e}nyes
In this paper we describe how the complexity of human communication can be analysed with the help of language technology.
no code implementations • LREC 2014 • Dagmar Jung, Katarzyna Klessa, Zsuzsa Duray, Beatrix Oszk{\'o}, M{\'a}ria Sipos, S{\'a}ndor Szever{\'e}nyi, Zsuzsa V{\'a}rnai, Paul Trilsbeek, Tam{\'a}s V{\'a}radi
The present paper describes the development of the languagesindanger. eu interactive website as an example of including multimedia language resources to{\^A} disseminate knowledge and create educational material on{\^A} less-resourced languages.
no code implementations • LREC 2014 • Georg Rehm, Hans Uszkoreit, Sophia Ananiadou, N{\'u}ria Bel, Audron{\.e} Bielevi{\v{c}}ien{\.e}, Lars Borin, Ant{\'o}nio Branco, Gerhard Budin, Nicoletta Calzolari, Walter Daelemans, Radovan Garab{\'\i}k, Marko Grobelnik, Carmen Garc{\'\i}a-Mateo, Josef van Genabith, Jan Haji{\v{c}}, Inma Hern{\'a}ez, John Judge, Svetla Koeva, Simon Krek, Cvetana Krstev, Krister Lind{\'e}n, Bernardo Magnini, Joseph Mariani, John McNaught, Maite Melero, Monica Monachini, Asunci{\'o}n Moreno, Jan Odijk, Maciej Ogrodniczuk, Piotr P{\k{e}}zik, Stelios Piperidis, Adam Przepi{\'o}rkowski, Eir{\'\i}kur R{\"o}gnvaldsson, Michael Rosner, Bolette Pedersen, Inguna Skadi{\c{n}}a, Koenraad De Smedt, Marko Tadi{\'c}, Paul Thompson, Dan Tufi{\c{s}}, Tam{\'a}s V{\'a}radi, Andrejs Vasi{\c{l}}jevs, Kadri Vider, Jolanta Zabarskaite
This article provides an overview of the dissemination work carried out in META-NET from 2010 until early 2014; we describe its impact on the regional, national and international level, mainly with regard to politics and the situation of funding for LT topics.
no code implementations • LREC 2014 • J{\'u}lia Pajzs, Ralf Steinberger, Maud Ehrmann, Mohamed Ebrahim, Leonida della Rocca, Stefano Bucci, Eszter Simon, Tam{\'a}s V{\'a}radi
In this paper, we describe the effort of adding to EMM Hungarian text mining tools for news gathering; document categorisation; named entity recognition and classification for persons, organisations and locations; name lemmatisation; quotation recognition; and cross-lingual linking of related news clusters.
no code implementations • LREC 2014 • Csaba Oravecz, Tam{\'a}s V{\'a}radi, B{\'a}lint Sass
The paper reports on the development of the Hungarian Gigaword Corpus (HGC), an extended new edition of the Hungarian National Corpus, with upgraded and redesigned linguistic annotation and an increased size of 1. 5 billion tokens.
no code implementations • LREC 2020 • Tam{\'a}s V{\'a}radi, Svetla Koeva, Martin Yamalov, Marko Tadi{\'c}, B{\'a}lint Sass, Bart{\l}omiej Nito{\'n}, Maciej Ogrodniczuk, Piotr P{\k{e}}zik, Verginica Barbu Mititelu, Radu Ion, Elena Irimia, Maria Mitrofan, Vasile P{\u{a}}i{\textcommabelow{s}}, Dan Tufi{\textcommabelow{s}}, Radovan Garab{\'\i}k, Simon Krek, Andraz Repar, Matja{\v{z}} Rihtar, Janez Brank
This article presents the current outcomes of the MARCELL CEF Telecom project aiming to collect and deeply annotate a large comparable corpus of legal documents.