Search Results for author: German Rigau

Found 68 papers, 15 papers with code

Overview of the ELE Project

no code implementations EAMT 2022 Itziar Aldabe, Jane Dunne, Aritz Farwell, Owen Gallagher, Federico Gaspari, Maria Giagkou, Jan Hajic, Jens Peter Kückens, Teresa Lynn, Georg Rehm, German Rigau, Katrin Marheinecke, Stelios Piperidis, Natalia Resende, Tea Vojtěchová, Andy Way

This paper provides an overview of the ongoing European Language Equality(ELE) project, an 18-month action funded by the European Commission which involves 52 partners.

The Predicate Matrix and the Event and Implied Situation Ontology: Making More of Events

no code implementations GWC 2016 Roxane Segers, Egoitz Laparra, Marco Rospocher, Piek Vossen, German Rigau, Filip Ilievski

This paper presents the Event and Implied Situation Ontology (ESO), a resource which formalizes the pre and post situations of events and the roles of the entities affected by an event.

Ask2Transformers: Zero-Shot Domain labelling with Pretrained Language Models

1 code implementation EACL (GWC) 2021 Oscar Sainz, German Rigau

In this paper we present a system that exploits different pre-trained Language Models for assigning domain labels to WordNet synsets without any kind of supervision.

Domain Labelling

Measuring HLT Research Equality of European Languages

no code implementations TDLE (LREC) 2022 Gorka Artola, German Rigau

Our ultimate goal is to investigate European language equality in HLT research considering the number of papers published on several HLT research venues that mention each language with respect to their estimated number of speakers.

ClinIDMap: Towards a Clinical IDs Mapping for Data Interoperability

no code implementations LREC 2022 Elena Zotova, Montse Cuadros, German Rigau

For instance, spans manually annotated with IDs from UMLS can be annotated with Semantic Types and Groups, and its corresponding SNOMED CT and ICD-10 IDs.

Towards Cross-checking WordNet and SUMO Using Meronymy

no code implementations GWC 2018 Javier Álvez, German Rigau

We describe the practical application of a black-box testing methodology for the validation of the knowledge encoded in WordNet, SUMO and their mapping by using automated theorem provers.

This is not a Dataset: A Large Negation Benchmark to Challenge Large Language Models

1 code implementation24 Oct 2023 Iker García-Ferrero, Begoña Altuna, Javier Álvez, Itziar Gonzalez-Dios, German Rigau

We have used our dataset with the largest available open LLMs in a zero-shot approach to grasp their generalization and inference capability and we have also fine-tuned some of the models to assess whether the understanding of negation can be trained.

Descriptive Negation +2

GoLLIE: Annotation Guidelines improve Zero-Shot Information-Extraction

1 code implementation5 Oct 2023 Oscar Sainz, Iker García-Ferrero, Rodrigo Agerri, Oier Lopez de Lacalle, German Rigau, Eneko Agirre

In this paper, we propose GoLLIE (Guideline-following Large Language Model for IE), a model able to improve zero-shot results on unseen IE tasks by virtue of being fine-tuned to comply with annotation guidelines.

 Ranked #1 on Zero-shot Named Entity Recognition (NER) on HarveyNER (using extra training data)

Event Argument Extraction Language Modelling +6

T-Projection: High Quality Annotation Projection for Sequence Labeling Tasks

2 code implementations20 Dec 2022 Iker García-Ferrero, Rodrigo Agerri, German Rigau

In the absence of readily available labeled data for a given sequence labeling task and language, annotation projection has been proposed as one of the possible strategies to automatically generate annotated data.

 Ranked #1 on Cross-Lingual NER on MasakhaNER2.0 (Hausa metric)

Cross-Lingual NER Machine Translation +2

Model and Data Transfer for Cross-Lingual Sequence Labelling in Zero-Resource Settings

4 code implementations23 Oct 2022 Iker García-Ferrero, Rodrigo Agerri, German Rigau

Zero-resource cross-lingual transfer approaches aim to apply supervised models from a source language to unlabelled target languages.

Cross-Lingual NER Machine Translation +1

Multilingual Central Repository: a Cross-lingual Framework for Developing Wordnets

no code implementations1 Jul 2021 Xavier Gómez Guinovart, Itziar Gonzalez-Dios, Antoni Oliver, German Rigau

Language resources are necessary for language processing, but building them is costly, involves many researches from different areas and needs constant updating.

Semi-automatic Generation of Multilingual Datasets for Stance Detection in Twitter

no code implementations28 Jan 2021 Elena Zotova, Rodrigo Agerri, German Rigau

While interactions in social media such as Twitter occur in many natural languages, research on stance detection (the position or attitude expressed with respect to a specific topic) within the Natural Language Processing field has largely been done for English.

Stance Detection

Ask2Transformers: Zero-Shot Domain labelling with Pre-trained Language Models

1 code implementation7 Jan 2021 Oscar Sainz, German Rigau

In this paper we present a system that exploits different pre-trained Language Models for assigning domain labels to WordNet synsets without any kind of supervision.

Domain Labelling

Towards modelling SUMO attributes through WordNet adjectives: a Case Study on Qualities

no code implementations LREC 2020 Itziar Gonzalez-Dios, Javier Alvez, German Rigau

In this context, we propose a new semi-automatic approach to model the knowledge about properties and attributes in SUMO by exploiting the information encoded in WordNet adjectives and its mapping to SUMO.

Multilingual Stance Detection in Tweets: The Catalonia Independence Corpus

no code implementations LREC 2020 Elena Zotova, Rodrigo Agerri, Manuel Nu{\~n}ez, German Rigau

The TW-10 referendum Dataset released at IberEval 2018 is a previous effort to provide multilingual stance-annotated data in Catalan and Spanish.

Stance Detection

NUBES: A Corpus of Negation and Uncertainty in Spanish Clinical Texts

1 code implementation LREC 2020 Salvador Lima, Naiara Perez, Montse Cuadros, German Rigau

This paper introduces the first version of the NUBes corpus (Negation and Uncertainty annotations in Biomedical texts in Spanish).

Negation

Multilingual Stance Detection: The Catalonia Independence Corpus

1 code implementation31 Mar 2020 Elena Zotova, Rodrigo Agerri, Manuel Nuñez, German Rigau

The TW-10 Referendum Dataset released at IberEval 2018 is a previous effort to provide multilingual stance-annotated data in Catalan and Spanish.

Stance Detection

Commonsense Reasoning Using WordNet and SUMO: a Detailed Analysis

no code implementations GWC 2019 Javier Álvez, Itziar Gonzalez-Dios, German Rigau

Our final objective is the extraction of some guidelines towards a better exploitation of this commonsense knowledge framework by the improvement of the included resources.

Language Independent Sequence Labelling for Opinion Target Extraction

no code implementations28 Jan 2019 Rodrigo Agerri, German Rigau

In this research note we present a language independent system to model Opinion Target Extraction (OTE) as a sequence labelling task.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +1

Applying the Closed World Assumption to SUMO-based FOL Ontologies for Effective Commonsense Reasoning

no code implementations14 Aug 2018 Javier Álvez, Itziar Gonzalez-Dios, German Rigau

In this paper, we investigate the application of the Closed World Assumption (CWA) to enable a better exploitation of FOL ontologies by using state-of-the-art automated theorem provers.

Translation

Validating WordNet Meronymy Relations using Adimen-SUMO

no code implementations20 May 2018 Javier Álvez, Itziar Gonzalez-Dios, German Rigau

In this paper, we report on the practical application of a novel approach for validating the knowledge of WordNet using Adimen-SUMO.

Biomedical term normalization of EHRs with UMLS

no code implementations LREC 2018 Naiara Perez, Montse Cuadros, German Rigau

This paper presents a novel prototype for biomedical term normalization of electronic health record excerpts with the Unified Medical Language System (UMLS) Metathesaurus.

Black-box Testing of First-Order Logic Ontologies Using WordNet

no code implementations29 May 2017 Javier Álvez, Paqui Lucio, German Rigau

Applying different quality criteria, our testing proposal enables a successful evaluation of a) the competency of several translations of SUMO into FOL and b) the performance of various automated ATPs.

Automatic White-Box Testing of First-Order Logic Ontologies

no code implementations29 May 2017 Javier Álvez, Montserrat Hermo, Paqui Lucio, German Rigau

Our proposal enables the detection of defects and serves to certify the grade of suitability --for reasoning purposes-- of every axiom.

W2VLDA: Almost Unsupervised System for Aspect Based Sentiment Analysis

1 code implementation22 May 2017 Aitor García-Pablos, Montse Cuadros, German Rigau

With the increase of online customer opinions in specialised websites and social networks, the necessity of automatic systems to help to organise and classify customer reviews by domain-specific aspect/categories and sentiment polarity is more important than ever.

Aspect-Based Sentiment Analysis Aspect-Based Sentiment Analysis (ABSA) +2

Q-WordNet PPV: Simple, Robust and (almost) Unsupervised Generation of Polarity Lexicons for Multiple Languages

no code implementations6 Feb 2017 Iñaki San Vicente, Rodrigo Agerri, German Rigau

This paper presents a simple, robust and (almost) unsupervised dictionary-based method, qwn-ppv (Q-WordNet as Personalized PageRanking Vector) to automatically generate polarity lexicons.

Sentiment Analysis

Multilingual and Cross-lingual Timeline Extraction

no code implementations2 Feb 2017 Egoitz Laparra, Rodrigo Agerri, Itziar Aldabe, German Rigau

In this paper we present an approach to extract ordered timelines of events, their participants, locations and times from a set of multilingual and cross-lingual data sources.

The Event and Implied Situation Ontology (ESO): Application and Evaluation

no code implementations LREC 2016 Roxane Segers, Marco Rospocher, Piek Vossen, Egoitz Laparra, German Rigau, Anne-Lyse Minard

This paper presents the Event and Implied Situation Ontology (ESO), a manually constructed resource which formalizes the pre and post situations of events and the roles of the entities affected by an event.

A Multilingual Predicate Matrix

no code implementations LREC 2016 Maddalen Lopez de Lacalle, Egoitz Laparra, Itziar Aldabe, German Rigau

This paper presents the Predicate Matrix 1. 3, a lexical resource resulting from the integration of multiple sources of predicate information including FrameNet, VerbNet, PropBank and WordNet.

A Comparison of Domain-based Word Polarity Estimation using different Word Embeddings

no code implementations LREC 2016 Aitor Garc{\'\i}a Pablos, Montse Cuadros, German Rigau

In basic Sentiment Analysis systems this sentiment polarity of the words is accounted and weighted in different ways to provide a degree of positivity/negativity.

Sentiment Analysis Word Embeddings

Addressing the MFS Bias in WSD systems

no code implementations LREC 2016 Marten Postma, Ruben Izquierdo, Eneko Agirre, German Rigau, Piek Vossen

Word Sense Disambiguation (WSD) systems tend to have a strong bias towards assigning the Most Frequent Sense (MFS), which results in high performance on the MFS but in a very low performance on the less frequent senses.

Word Sense Disambiguation

Improving the Competency of First-Order Ontologies

no code implementations16 Oct 2015 Javier Álvez, Paqui Lucio, German Rigau

We introduce a new framework to evaluate and improve first-order (FO) ontologies using automated theorem provers (ATPs) on the basis of competency questions (CQs).

Evaluating the Competency of a First-Order Ontology

no code implementations16 Oct 2015 Javier Álvez, Paqui Lucio, German Rigau

We report on the results of evaluating the competency of a first-order ontology for its use with automated theorem provers (ATPs).

NewsReader: recording history from daily news streams

no code implementations LREC 2014 Piek Vossen, German Rigau, Luciano Serafini, Pim Stouten, Francis Irving, Willem van Hage

The European project NewsReader develops technology to process daily news streams in 4 languages, extracting what happened, when, where and who was involved.

IXA pipeline: Efficient and Ready to Use Multilingual NLP tools

no code implementations LREC 2014 Rodrigo Agerri, Josu Bermudez, German Rigau

IXA pipeline is a modular set of Natural Language Processing tools (or pipes) which provide easy access to NLP technology.

Coreference Resolution Multilingual NLP +2

Predicate Matrix: extending SemLink through WordNet mappings

no code implementations LREC 2014 Maddalen Lopez de Lacalle, Egoitz Laparra, German Rigau

This paper presents the Predicate Matrix v1. 1, a new lexical resource resulting from the integration of multiple sources of predicate information including FrameNet, VerbNet, PropBank and WordNet.

Natural Language Inference Question Answering +2

Highlighting relevant concepts from Topic Signatures

no code implementations LREC 2012 Montse Cuadros, Llu{\'\i}s Padr{\'o}, German Rigau

Basically, the method applies a knowledge-based Word Sense Disambiguation algorithm to assign the most appropriate WordNet sense to large sets of topically related words acquired from the web, named TSWEB.

Word Sense Disambiguation

A proposal for improving WordNet Domains

no code implementations LREC 2012 Aitor Gonz{\'a}lez-Agirre, Mauro Castillo, German Rigau

Moreover, it is very difficult to quantify the number of errors in the original version of WND.

Multilingual Central Repository version 3.0

no code implementations LREC 2012 Aitor Gonzalez-Agirre, Egoitz Laparra, German Rigau

This paper describes the upgrading process of the Multilingual Central Repository (MCR).

Cannot find the paper you are looking for? You can Submit a new open access paper.