no code implementations • EAMT 2022 • Itziar Aldabe, Jane Dunne, Aritz Farwell, Owen Gallagher, Federico Gaspari, Maria Giagkou, Jan Hajic, Jens Peter Kückens, Teresa Lynn, Georg Rehm, German Rigau, Katrin Marheinecke, Stelios Piperidis, Natalia Resende, Tea Vojtěchová, Andy Way
This paper provides an overview of the ongoing European Language Equality(ELE) project, an 18-month action funded by the European Commission which involves 52 partners.
no code implementations • TDLE (LREC) 2022 • Gorka Artola, German Rigau
Our ultimate goal is to investigate European language equality in HLT research considering the number of papers published on several HLT research venues that mention each language with respect to their estimated number of speakers.
no code implementations • GWC 2016 • Roxane Segers, Egoitz Laparra, Marco Rospocher, Piek Vossen, German Rigau, Filip Ilievski
This paper presents the Event and Implied Situation Ontology (ESO), a resource which formalizes the pre and post situations of events and the roles of the entities affected by an event.
no code implementations • GWC 2018 • Javier Álvez, German Rigau
We describe the practical application of a black-box testing methodology for the validation of the knowledge encoded in WordNet, SUMO and their mapping by using automated theorem provers.
1 code implementation • EACL (GWC) 2021 • Oscar Sainz, German Rigau
In this paper we present a system that exploits different pre-trained Language Models for assigning domain labels to WordNet synsets without any kind of supervision.
1 code implementation • LREC 2022 • Elena Zotova, Montse Cuadros, German Rigau
For instance, spans manually annotated with IDs from UMLS can be annotated with Semantic Types and Groups, and its corresponding SNOMED CT and ICD-10 IDs.
1 code implementation • Findings (EMNLP) 2021 • Iker García-Ferrero, Rodrigo Agerri, German Rigau
In the last few years, several methods have been proposed to build meta-embeddings.
no code implementations • 11 Apr 2024 • Iker García-Ferrero, Rodrigo Agerri, Aitziber Atutxa Salazar, Elena Cabrio, Iker de la Iglesia, Alberto Lavelli, Bernardo Magnini, Benjamin Molinet, Johana Ramirez-Romero, German Rigau, Jose Maria Villa-Gonzalez, Serena Villata, Andrea Zaninello
While these LLMs display competitive performance on automated medical texts benchmarks, they have been pre-trained and evaluated with a focus on a single language (English mostly).
1 code implementation • 29 Mar 2024 • Julen Etxaniz, Oscar Sainz, Naiara Perez, Itziar Aldabe, German Rigau, Eneko Agirre, Aitor Ormazabal, Mikel Artetxe, Aitor Soroa
We introduce Latxa, a family of large language models for Basque ranging from 7 to 70 billion parameters.
1 code implementation • 24 Oct 2023 • Iker García-Ferrero, Begoña Altuna, Javier Álvez, Itziar Gonzalez-Dios, German Rigau
We have used our dataset with the largest available open LLMs in a zero-shot approach to grasp their generalization and inference capability and we have also fine-tuned some of the models to assess whether the understanding of negation can be trained.
1 code implementation • 5 Oct 2023 • Oscar Sainz, Iker García-Ferrero, Rodrigo Agerri, Oier Lopez de Lacalle, German Rigau, Eneko Agirre
In this paper, we propose GoLLIE (Guideline-following Large Language Model for IE), a model able to improve zero-shot results on unseen IE tasks by virtue of being fine-tuned to comply with annotation guidelines.
Ranked #1 on
Zero-shot Named Entity Recognition (NER)
on HarveyNER
(using extra training data)
1 code implementation • 9 Jun 2023 • Rodrigo Agerri, Iñigo Alonso, Aitziber Atutxa, Ander Berrondo, Ainara Estarrona, Iker Garcia-Ferrero, Iakes Goenaga, Koldo Gojenola, Maite Oronoz, Igor Perez-Tejedor, German Rigau, Anar Yeginbergenova
Providing high quality explanations for AI predictions based on machine learning is a challenging and complex task.
1 code implementation • 27 Apr 2023 • Nayla Escribano, German Rigau, Rodrigo Agerri
Detecting and normalizing temporal expressions is an essential step for many NLP tasks.
no code implementations • 7 Feb 2023 • Oscar Sainz, Oier Lopez de Lacalle, Eneko Agirre, German Rigau
Language Models are the core for almost any Natural Language Processing system nowadays.
2 code implementations • 20 Dec 2022 • Iker García-Ferrero, Rodrigo Agerri, German Rigau
In the absence of readily available labeled data for a given sequence labeling task and language, annotation projection has been proposed as one of the possible strategies to automatically generate annotated data.
Ranked #1 on
Cross-Lingual NER
on MasakhaNER2.0
(Hausa metric)
4 code implementations • 23 Oct 2022 • Iker García-Ferrero, Rodrigo Agerri, German Rigau
Zero-resource cross-lingual transfer approaches aim to apply supervised models from a source language to unlabelled target languages.
Ranked #1 on
Cross-Lingual NER
on CoNLL Spanish
no code implementations • 1 Jul 2021 • Xavier Gómez Guinovart, Itziar Gonzalez-Dios, Antoni Oliver, German Rigau
Language resources are necessary for language processing, but building them is costly, involves many researches from different areas and needs constant updating.
no code implementations • 28 Jan 2021 • Elena Zotova, Rodrigo Agerri, German Rigau
While interactions in social media such as Twitter occur in many natural languages, research on stance detection (the position or attitude expressed with respect to a specific topic) within the Natural Language Processing field has largely been done for English.
1 code implementation • 7 Jan 2021 • Oscar Sainz, German Rigau
In this paper we present a system that exploits different pre-trained Language Models for assigning domain labels to WordNet synsets without any kind of supervision.
Ranked #1 on
Domain Labelling
on BabelDomains
no code implementations • LREC 2020 • Elena Zotova, Rodrigo Agerri, Manuel Nu{\~n}ez, German Rigau
The TW-10 referendum Dataset released at IberEval 2018 is a previous effort to provide multilingual stance-annotated data in Catalan and Spanish.
no code implementations • LREC 2020 • Itziar Gonzalez-Dios, Javier Alvez, German Rigau
In this context, we propose a new semi-automatic approach to model the knowledge about properties and attributes in SUMO by exploiting the information encoded in WordNet adjectives and its mapping to SUMO.
1 code implementation • LREC 2020 • Salvador Lima, Naiara Perez, Montse Cuadros, German Rigau
This paper introduces the first version of the NUBes corpus (Negation and Uncertainty annotations in Biomedical texts in Spanish).
1 code implementation • 31 Mar 2020 • Elena Zotova, Rodrigo Agerri, Manuel Nuñez, German Rigau
The TW-10 Referendum Dataset released at IberEval 2018 is a previous effort to provide multilingual stance-annotated data in Catalan and Spanish.
2 code implementations • 17 Jan 2020 • Iker García-Ferrero, Rodrigo Agerri, German Rigau
This paper presents a new technique for creating monolingual and cross-lingual meta-embeddings.
no code implementations • GWC 2019 • Javier Álvez, Itziar Gonzalez-Dios, German Rigau
Our final objective is the extraction of some guidelines towards a better exploitation of this commonsense knowledge framework by the improvement of the included resources.
no code implementations • 28 Jan 2019 • Rodrigo Agerri, German Rigau
In this research note we present a language independent system to model Opinion Target Extraction (OTE) as a sequence labelling task.
Aspect-Based Sentiment Analysis
Aspect-Based Sentiment Analysis (ABSA)
+1
no code implementations • 14 Aug 2018 • Javier Álvez, Itziar Gonzalez-Dios, German Rigau
In this paper, we investigate the application of the Closed World Assumption (CWA) to enable a better exploitation of FOL ontologies by using state-of-the-art automated theorem provers.
no code implementations • 20 May 2018 • Javier Álvez, Itziar Gonzalez-Dios, German Rigau
In this paper, we report on the practical application of a novel approach for validating the knowledge of WordNet using Adimen-SUMO.
no code implementations • LREC 2018 • Naiara Perez, Montse Cuadros, German Rigau
This paper presents a novel prototype for biomedical term normalization of electronic health record excerpts with the Unified Medical Language System (UMLS) Metathesaurus.
no code implementations • 29 May 2017 • Javier Álvez, Montserrat Hermo, Paqui Lucio, German Rigau
Our proposal enables the detection of defects and serves to certify the grade of suitability --for reasoning purposes-- of every axiom.
no code implementations • 29 May 2017 • Javier Álvez, Paqui Lucio, German Rigau
Applying different quality criteria, our testing proposal enables a successful evaluation of a) the competency of several translations of SUMO into FOL and b) the performance of various automated ATPs.
1 code implementation • 22 May 2017 • Aitor García-Pablos, Montse Cuadros, German Rigau
With the increase of online customer opinions in specialised websites and social networks, the necessity of automatic systems to help to organise and classify customer reviews by domain-specific aspect/categories and sentiment polarity is more important than ever.
Aspect-Based Sentiment Analysis
Aspect-Based Sentiment Analysis (ABSA)
+2
no code implementations • 6 Feb 2017 • Iñaki San Vicente, Rodrigo Agerri, German Rigau
This paper presents a simple, robust and (almost) unsupervised dictionary-based method, qwn-ppv (Q-WordNet as Personalized PageRanking Vector) to automatically generate polarity lexicons.
no code implementations • 2 Feb 2017 • Egoitz Laparra, Rodrigo Agerri, Itziar Aldabe, German Rigau
In this paper we present an approach to extract ordered timelines of events, their participants, locations and times from a set of multilingual and cross-lingual data sources.
1 code implementation • 31 Jan 2017 • Rodrigo Agerri, German Rigau
Finally, the results show that our emphasis on clustering features is crucial to develop robust out-of-domain models.
Ranked #63 on
Named Entity Recognition (NER)
on CoNLL 2003 (English)
no code implementations • LREC 2016 • Maddalen Lopez de Lacalle, Egoitz Laparra, Itziar Aldabe, German Rigau
This paper presents the Predicate Matrix 1. 3, a lexical resource resulting from the integration of multiple sources of predicate information including FrameNet, VerbNet, PropBank and WordNet.
no code implementations • LREC 2016 • Aitor Garc{\'\i}a Pablos, Montse Cuadros, German Rigau
In basic Sentiment Analysis systems this sentiment polarity of the words is accounted and weighted in different ways to provide a degree of positivity/negativity.
no code implementations • LREC 2016 • Roxane Segers, Marco Rospocher, Piek Vossen, Egoitz Laparra, German Rigau, Anne-Lyse Minard
This paper presents the Event and Implied Situation Ontology (ESO), a manually constructed resource which formalizes the pre and post situations of events and the roles of the entities affected by an event.
no code implementations • LREC 2016 • Marten Postma, Ruben Izquierdo, Eneko Agirre, German Rigau, Piek Vossen
Word Sense Disambiguation (WSD) systems tend to have a strong bias towards assigning the Most Frequent Sense (MFS), which results in high performance on the MFS but in a very low performance on the less frequent senses.
no code implementations • 16 Oct 2015 • Javier Álvez, Paqui Lucio, German Rigau
We report on the results of evaluating the competency of a first-order ontology for its use with automated theorem provers (ATPs).
no code implementations • 16 Oct 2015 • Javier Álvez, Paqui Lucio, German Rigau
We introduce a new framework to evaluate and improve first-order (FO) ontologies using automated theorem provers (ATPs) on the basis of competency questions (CQs).
no code implementations • SEMEVAL 2015 • Eneko Agirre, Carmen Banea, Claire Cardie, Daniel Cer, Mona Diab, Aitor Gonzalez-Agirre, Weiwei Guo, I{\~n}igo Lopez-Gazpio, Montse Maritxalar, Rada Mihalcea, German Rigau, Larraitz Uria, Janyce Wiebe
no code implementations • LREC 2014 • Piek Vossen, German Rigau, Luciano Serafini, Pim Stouten, Francis Irving, Willem van Hage
The European project NewsReader develops technology to process daily news streams in 4 languages, extracting what happened, when, where and who was involved.
no code implementations • LREC 2014 • Maddalen Lopez de Lacalle, Egoitz Laparra, German Rigau
This paper presents the Predicate Matrix v1. 1, a new lexical resource resulting from the integration of multiple sources of predicate information including FrameNet, VerbNet, PropBank and WordNet.
no code implementations • LREC 2014 • Rodrigo Agerri, Josu Bermudez, German Rigau
IXA pipeline is a modular set of Natural Language Processing tools (or pipes) which provide easy access to NLP technology.
no code implementations • LREC 2012 • Aitor Gonz{\'a}lez-Agirre, Mauro Castillo, German Rigau
Moreover, it is very difficult to quantify the number of errors in the original version of WND.
no code implementations • LREC 2012 • Egoitz Laparra, German Rigau, Piek Vossen
This paper describes the connection of WordNet to a generic ontology based on DOLCE.
no code implementations • LREC 2012 • Montse Cuadros, Llu{\'\i}s Padr{\'o}, German Rigau
Basically, the method applies a knowledge-based Word Sense Disambiguation algorithm to assign the most appropriate WordNet sense to large sets of topically related words acquired from the web, named TSWEB.
no code implementations • LREC 2012 • Aitor Gonzalez-Agirre, Egoitz Laparra, German Rigau
This paper describes the upgrading process of the Multilingual Central Repository (MCR).