1 code implementation • Findings (EMNLP) 2021 • Iker García-Ferrero, Rodrigo Agerri, German Rigau
In the last few years, several methods have been proposed to build meta-embeddings.
no code implementations • 31 Jul 2024 • Oscar Sainz, Iker García-Ferrero, Alon Jacovi, Jon Ander Campos, Yanai Elazar, Eneko Agirre, Yoav Goldberg, Wei-Lin Chen, Jenny Chim, Leshem Choshen, Luca D'Amico-Wong, Melissa Dell, Run-Ze Fan, Shahriar Golchin, Yucheng Li, PengFei Liu, Bhavish Pahwa, Ameya Prabhu, Suryansh Sharma, Emily Silcock, Kateryna Solonko, David Stap, Mihai Surdeanu, Yu-Min Tseng, Vishaal Udandarao, Zengzhi Wang, Ruijie Xu, Jinglin Yang
The workshop fostered a shared task to collect evidence on data contamination in current available datasets and models.
no code implementations • 11 Apr 2024 • Iker García-Ferrero, Rodrigo Agerri, Aitziber Atutxa Salazar, Elena Cabrio, Iker de la Iglesia, Alberto Lavelli, Bernardo Magnini, Benjamin Molinet, Johana Ramirez-Romero, German Rigau, Jose Maria Villa-Gonzalez, Serena Villata, Andrea Zaninello
While these LLMs display competitive performance on automated medical texts benchmarks, they have been pre-trained and evaluated with a focus on a single language (English mostly).
1 code implementation • 11 Apr 2024 • Iker García-Ferrero, Begoña Altuna
We present NoticIA, a dataset consisting of 850 Spanish news articles featuring prominent clickbait headlines, each paired with high-quality, single-sentence generative summarizations written by humans.
1 code implementation • 27 Oct 2023 • Oscar Sainz, Jon Ander Campos, Iker García-Ferrero, Julen Etxaniz, Oier Lopez de Lacalle, Eneko Agirre
In this position paper, we argue that the classical evaluation on Natural Language Processing (NLP) tasks using annotated benchmarks is in trouble.
1 code implementation • 24 Oct 2023 • Iker García-Ferrero, Begoña Altuna, Javier Álvez, Itziar Gonzalez-Dios, German Rigau
We have used our dataset with the largest available open LLMs in a zero-shot approach to grasp their generalization and inference capability and we have also fine-tuned some of the models to assess whether the understanding of negation can be trained.
1 code implementation • 5 Oct 2023 • Oscar Sainz, Iker García-Ferrero, Rodrigo Agerri, Oier Lopez de Lacalle, German Rigau, Eneko Agirre
In this paper, we propose GoLLIE (Guideline-following Large Language Model for IE), a model able to improve zero-shot results on unseen IE tasks by virtue of being fine-tuned to comply with annotation guidelines.
Ranked #1 on Zero-shot Named Entity Recognition (NER) on HarveyNER (using extra training data)
1 code implementation • 20 Apr 2023 • Iker García-Ferrero, Jon Ander Campos, Oscar Sainz, Ander Salaberria, Dan Roth
Named Entity Recognition (NER) is a core natural language processing task in which pre-trained language models have shown remarkable performance.
Multilingual Named Entity Recognition named-entity-recognition +4
2 code implementations • 20 Dec 2022 • Iker García-Ferrero, Rodrigo Agerri, German Rigau
In the absence of readily available labeled data for a given sequence labeling task and language, annotation projection has been proposed as one of the possible strategies to automatically generate annotated data.
Ranked #1 on Cross-Lingual NER on MasakhaNER2.0 (Hausa metric)
4 code implementations • 23 Oct 2022 • Iker García-Ferrero, Rodrigo Agerri, German Rigau
Zero-resource cross-lingual transfer approaches aim to apply supervised models from a source language to unlabelled target languages.
Ranked #1 on Cross-Lingual NER on CoNLL Spanish
2 code implementations • 17 Jan 2020 • Iker García-Ferrero, Rodrigo Agerri, German Rigau
This paper presents a new technique for creating monolingual and cross-lingual meta-embeddings.