no code implementations • 28 Feb 2023 • Simon Gottschalk, Endri Kacupaj, Sara Abdollahi, Diego Alves, Gabriel Amaral, Elisavet Koutsiana, Tin Kuculo, Daniela Major, Caio Mello, Gullal S. Cheema, Abdul Sittar, Swati, Golsa Tahmasebzadeh, Gaurish Thakkar
Accessing and understanding contemporary and historical events of global impact such as the US elections and the Olympic Games is a major prerequisite for cross-lingual event analytics that investigate event causes, perception and consequences across country borders.
no code implementations • 14 Dec 2022 • Diego Alves, Gaurish Thakkar, Gabriel Amaral, Tin Kuculo, Marko Tadić
With the ever-growing popularity of the field of NLP, the demand for datasets in low resourced-languages follows suit.
no code implementations • 26 Oct 2022 • Gabriel Amaral, Odinaldo Rodrigues, Elena Simperl
Knowledge Graphs are repositories of information that gather data from a multitude of domains and sources in the form of semantic triples, serving as a source of structured data for various crucial applications in the modern web landscape, from Wikipedia infoboxes to search engines.
no code implementations • 17 Jun 2022 • Gabriel Amaral, Mārcis Pinnis, Inguna Skadiņa, Odinaldo Rodrigues, Elena Simperl
However, such labels are not guaranteed to match across languages from an information consistency standpoint, greatly compromising their usefulness for fields such as machine translation.
1 code implementation • 5 May 2022 • Gabriel Amaral, Odinaldo Rodrigues, Elena Simperl
Data verbalisation is a task of great importance in the current field of natural language processing, as there is great benefit in the transformation of our abundant structured and semi-structured data into human-readable formats.
no code implementations • 20 Sep 2021 • Gabriel Amaral, Alessandro Piscopo, Lucie-Aimée Kaffee, Odinaldo Rodrigues, Elena Simperl
Wikidata is one of the most important sources of structured data on the web, built by a worldwide community of volunteers.
no code implementations • 23 Oct 2020 • Diego Alves, Tin Kuculo, Gabriel Amaral, Gaurish Thakkar, Marko Tadic
We introduce the Universal Named-Entity Recognition (UNER)framework, a 4-level classification hierarchy, and the methodology that isbeing adopted to create the first multilingual UNER corpus: the SETimesparallel corpus annotated for named-entities.