no code implementations • LT4HALA (LREC) 2022 • Manuel Favaro, Elisa Guadagnini, Eva Sassolini, Marco Biffi, Simonetta Montemagni
In this paper we describe some experiments related to a corpus derived from an authoritative historical Italian dictionary, namely the Grande dizionario della lingua italiana (‘Great Dictionary of Italian Language’, in short GDLI).
no code implementations • ParlaCLARIN (LREC) 2022 • Tommaso Agnoloni, Roberto Bartolini, Francesca Frontini, Simonetta Montemagni, Carlo Marchetti, Valeria Quochi, Manuela Ruisi, Giulia Venturi
The corpus contains 1199 sessions and 79, 373 speeches, for a total of about 31 million words and was encoded according to the ParlaCLARIN TEI XML format, as well as in CoNLL-UD format.
no code implementations • LREC 2020 • Federico Boschetti, Irene De Felice, Stefano Dei Rossi, Felice Dell{'}Orletta, Michele Di Giorgio, Martina Miliani, Lucia C. Passaro, Angelica Puddu, Giulia Venturi, Nicola Labanca, Aless Lenci, ro, Simonetta Montemagni
{``}Voices of the Great War{''} is the first large corpus of Italian historical texts dating back to the period of First World War.
no code implementations • LREC 2020 • Dominique Brunato, Andrea Cimino, Felice Dell{'}Orletta, Giulia Venturi, Simonetta Montemagni
In this paper, we introduce Profiling{--}UD, a new text analysis tool inspired to the principles of linguistic profiling that can support language variation research from different perspectives.
no code implementations • WS 2018 • Joakim Nivre, Paola Marongiu, Filip Ginter, Jenna Kanerva, Simonetta Montemagni, Sebastian Schuster, Maria Simi
We evaluate two cross-lingual techniques for adding enhanced dependencies to existing treebanks in Universal Dependencies.
no code implementations • WS 2018 • Chiara Alzetta, Felice Dell{'}Orletta, Simonetta Montemagni, Maria Simi, Giulia Venturi
For both evaluation datasets, the performance of parsers increases, in terms of the standard LAS and UAS measures and of a more focused measure taking into account only relations involved in error patterns, and at the level of individual dependencies.
no code implementations • LREC 2016 • Martijn Wieling, Eva Sassolini, Sebastiana Cucurullo, Simonetta Montemagni
In this paper, we illustrate the integration of an online dialectometric tool, Gabmap, together with an online dialect atlas, the Atlante Lessicale Toscano (ALT-Web).
no code implementations • LREC 2016 • Alessia Barbagli, Pietro Lucisano, Felice Dell{'}Orletta, Simonetta Montemagni, Giulia Venturi
In this paper, we present the CItA corpus (Corpus Italiano di Apprendenti L1), a collection of essays written by Italian L1 learners collected during the first and second year of lower secondary school.
no code implementations • LREC 2014 • Maria Simi, Cristina Bosco, Simonetta Montemagni
This is done by comparing the performance of a statistical parser (DeSR) trained on a simpler resource (the augmented version of the Merged Italian Dependency Treebank or MIDT+) and whose output was automatically converted to SD, with the results of the parser directly trained on ISDT.
no code implementations • LREC 2014 • Felice Dell{'}Orletta, Giulia Venturi, Andrea Cimino, Simonetta Montemagni
In this paper, we present T2K{\textasciicircum}2, a suite of tools for automatically extracting domain―specific knowledge from collections of Italian and English texts.
no code implementations • LREC 2012 • Aless Lenci, ro, Simonetta Montemagni, Giulia Venturi, Maria Grazia Cutrull{\`a}
The paper describes the design and the results of a manual annotation methodology devoted to enrich the ISST--TANL Corpus, derived from the Italian Syntactic--Semantic Treebank (ISST), with Semantic Frames information.