no code implementations • LREC 2012 • Jorge Vivaldi, Luis Adri{\'a}n Cabrera-Diego, Gerardo Sierra, Mar{\'\i}a Pozzi
A scientific vocabulary is a set of terms that designate scientific concepts.
no code implementations • 20 Jan 2015 • Gerardo Sierra, Juan-Manuel Torres-Moreno, Alejandro Molina
This article focuses on the description and evaluation of a new unsupervised learning method of clustering of definitions in Spanish according to their semantic.
no code implementations • LREC 2016 • Ximena Gutierrez-Vasques, Gerardo Sierra, Isaac Hern Pompa, ez
This paper describes the project called Axolotl which comprises a Spanish-Nahuatl parallel corpus and its search interface.
no code implementations • 21 Feb 2017 • Carlos-Emiliano González-Gallardo, Juan-Manuel Torres-Moreno, Azucena Montes Rendón, Gerardo Sierra
In this paper we describe a dynamic normalization process applied to social network multilingual documents (Facebook and Twitter) to improve the performance of the Author profiling task for short texts.
no code implementations • 11 Mar 2017 • Juan-Manuel Torres-Moreno, Gerardo Sierra, Peter Peinl
The purpose of this corpus is to automatically assess the similarity between a pair of texts and to evaluate different similarity measures, both for whole documents or for individual sentences.
no code implementations • 17 Oct 2017 • Ignacio Arroyo-Fernández, Carlos-Francisco Méndez-Cruz, Gerardo Sierra, Juan-Manuel Torres-Moreno, Grigori Sidorov
Results showed that our model outperformed the state of the art in well-known Semantic Textual Similarity (STS) benchmarks.
Open-Ended Question Answering Semantic Textual Similarity +3
1 code implementation • COLING 2018 • Manuel Mager, Ximena Gutierrez-Vasques, Gerardo Sierra, Ivan Meza
Indigenous languages of the American continent are highly diverse.
no code implementations • WS 2018 • Alej Dorantes, ro, Gerardo Sierra, Tlauhlia Yam{\'\i}n Donohue P{\'e}rez, Gemma Bel-Enguix, M{\'o}nica Jasso Rosales
This work presents the Sociolinguistic Corpus of WhatsApp Chats in Spanish among College Students, a corpus of raw data for general use.