no code implementations • LREC (MWE) 2022 • Yagmur Ozturk, Najet Hadj Mohamed, Adam Lion-Bouton, Agata Savary
We provide an overview of the problems observed in the morphosyntactic annotation of the Turkish PARSEME corpus.
1 code implementation • COLING (MWE) 2020 • Agata Savary, Jakub Waszczuk
This paper describes a manually annotated corpus of verbal multi-word expressions in Polish.
no code implementations • COLING 2022 • Adam Lion-Bouton, Yagmur Ozturk, Agata Savary, Jean-Yves Antoine
We apply the validated measures to annotations in 14 languages produced by systems during the PARSEME shared task on automatic identification of multiword expressions and on the gold versions of the corpora.
no code implementations • COLING (MWE) 2020 • Caroline Pasquer, Agata Savary, Carlos Ramisch, Jean-Yves Antoine
We describe the Seen2Unseen system that participated in edition 1. 2 of the PARSEME shared task on automatic identification of verbal multiword expressions (VMWEs).
no code implementations • COLING (MWE) 2020 • Carlos Ramisch, Agata Savary, Bruno Guillaume, Jakub Waszczuk, Marie Candito, Ashwini Vaidya, Verginica Barbu Mititelu, Archna Bhatia, Uxoa Iñurrieta, Voula Giouli, Tunga Güngör, Menghan Jiang, Timm Lichte, Chaya Liebeskind, Johanna Monti, Renata Ramisch, Sara Stymne, Abigail Walsh, Hongzhi Xu
We present edition 1. 2 of the PARSEME shared task on identification of verbal multiword expressions (VMWEs).
no code implementations • LREC 2022 • Najet Hadj Mohamed, Cherifa Ben Khelil, Agata Savary, Iskandar Keskes, Jean-Yves Antoine, Lamia Hadrich-Belguith
This paper describes our efforts to extend the PARSEME framework to Modern Standard Arabic.
no code implementations • JEP/TALN/RECITAL 2022 • Najet Hadj Mohamed, Cherifa Ben Khelil, Agata Savary, Iskander Keskes, Jean Yves Antoine, lamia hadrich belguith
Cet article décrit nos efforts pour étendre le projet PARSEME à l’arabe standard moderne.
no code implementations • 14 Jan 2025 • Louis Estève, Manon Scholivet, Agata Savary
Diversity is an important property of datasets and sampling data for diversity is useful in dataset creation.
no code implementations • COLING 2020 • Caroline Pasquer, Agata Savary, Carlos Ramisch, Jean-Yves Antoine
Automatic identification of multiword expressions (MWEs), like {`}to cut corners{'} (to do an incomplete job), is a pre-requisite for semantically-oriented downstream applications.
no code implementations • 22 Jul 2020 • Caroline Pasquer, Agata Savary, Jean-Yves Antoine, Carlos Ramisch, Nicolas Labroche, Arnaud Giacometti
We use this fact to determine the optimal set of features which could be used in a supervised classification setting to solve a subproblem of VMWE identification: the identification of occurrences of previously seen VMWEs.
no code implementations • JEPTALNRECITAL 2020 • Anne-Lyse Minard, Andr{\'e}ane Roques, Nicolas Hiot, Mirian Halfeld Ferrari Alves, Agata Savary
Cet article pr{\'e}sente le syst{\`e}me d{\'e}velopp{\'e} par l{'}{\'e}quipe DOING pour la campagne d{'}{\'e}valuation DEFT 2020 portant sur la similarit{\'e} s{\'e}mantique et l{'}extraction d{'}information fine.
no code implementations • WS 2019 • Agata Savary, Silvio Cordeiro, Carlos Ramisch
Because most multiword expressions (MWEs), especially verbal ones, are semantically non-compositional, their automatic identification in running text is a prerequisite for semantically-oriented downstream applications.
no code implementations • JEPTALNRECITAL 2019 • Marine Schmitt, Elise Moreau, Mathieu Constant, Agata Savary
Nous pr{\'e}sentons le d{\'e}monstrateur en-ligne du projet ANR PARSEME-FR d{\'e}di{\'e} aux expressions polylexicales.
no code implementations • 23 Oct 2018 • Agata Savary, Simon Petitjean, Timm Lichte, Laura Kallmeyer, Jakub Waszczuk
Multiword expressions (MWEs) exhibit both regular and idiosyncratic properties.
no code implementations • COLING 2018 • Carlos Ramisch, Silvio Ricardo Cordeiro, Agata Savary, Veronika Vincze, Verginica Barbu Mititelu, Archna Bhatia, Maja Buljan, C, Marie ito, Polona Gantar, Voula Giouli, Tunga G{\"u}ng{\"o}r, Abdelati Hawwari, Uxoa I{\~n}urrieta, Jolanta Kovalevskait{\.e}, Simon Krek, Timm Lichte, Chaya Liebeskind, Johanna Monti, Carla Parra Escart{\'\i}n, Behrang Qasemizadeh, Renata Ramisch, Nathan Schneider, Ivelina Stoyanova, Ashwini Vaidya, Abigail Walsh
Corpora were created for 20 languages, which are also briefly discussed.
no code implementations • COLING 2018 • Caroline Pasquer, Carlos Ramisch, Agata Savary, Jean-Yves Antoine
We describe the VarIDE system (standing for Variant IDEntification) which participated in the edition 1. 1 of the PARSEME shared task on automatic identification of verbal multiword expressions (VMWEs).
no code implementations • COLING 2018 • Caroline Pasquer, Agata Savary, Carlos Ramisch, Jean-Yves Antoine
Multiword expressions, especially verbal ones (VMWEs), show idiosyncratic variability, which is challenging for NLP applications, hence the need for VMWE identification.
no code implementations • NAACL 2018 • Caroline Pasquer, Agata Savary, Jean-Yves Antoine, Carlos Ramisch
One of the most outstanding properties of multiword expressions (MWEs), especially verbal ones (VMWEs), important both in theoretical models and applications, is their idiosyncratic variability.
no code implementations • JEPTALNRECITAL 2017 • C, Marie ito, Mathieu Constant, Carlos Ramisch, Agata Savary, Yannick Parmentier, Caroline Pasquer, Jean-Yves Antoine
Nous d{\'e}crivons la partie fran{\c{c}}aise des donn{\'e}es produites dans le cadre de la campagne multilingue PARSEME sur l{'}identification d{'}expressions polylexicales verbales (Savary et al., 2017).
no code implementations • WS 2017 • Agata Savary, Jakub Waszczuk
Multiword expressions (MWEs) are linguistic objects containing two or more words and showing idiosyncratic behavior at different levels.
no code implementations • WS 2017 • Agata Savary, Carlos Ramisch, Silvio Cordeiro, Federico Sangati, Veronika Vincze, Behrang Qasemizadeh, C, Marie ito, Fabienne Cap, Voula Giouli, Ivelina Stoyanova, Antoine Doucet
This paper presents the corpus annotation methodology and outcome, the shared task organisation and the results of the participating systems.
no code implementations • COLING 2016 • Jakub Waszczuk, Agata Savary, Yannick Parmentier
Multiword expressions (MWEs) are pervasive in natural languages and often have both idiomatic and compositional readings, which leads to high syntactic ambiguity.
no code implementations • LREC 2016 • Victoria Ros{\'e}n, Koenraad De Smedt, Gyri Sm{\o}rdal Losnegaard, Eduard Bej{\v{c}}ek, Agata Savary, Petya Osenova
The comparison is focused on the annotation of light verb constructions and verbal idioms.
no code implementations • LREC 2016 • Ana{\"\i}s Lefeuvre-Halftermeyer, Jean-Yves Antoine, Alain Couillault, Emmanuel Schang, Lotfi Abouda, Agata Savary, Denis Maurel, Iris Eshkol, Delphine Battistelli
This paper reports a critical analysis of the ISO TimeML standard, in the light of several experiences of temporal annotation that were conducted on spoken French.
no code implementations • LREC 2016 • Gyri Sm{\o}rdal Losnegaard, Federico Sangati, Carla Parra Escart{\'\i}n, Agata Savary, Sascha Bargmann, Johanna Monti
We also discuss the problems we have detected upon examination of the data as well as possible ways of enhancing the survey.
no code implementations • LREC 2016 • Diana Bogantes, Eric Rodr{\'\i}guez, Alej Arauco, Alej Rodr{\'\i}guez, ro, Agata Savary
This paper describes a pilot study in lexical encoding of multi-word expressions (MWEs) in 4 Latin American dialects of Spanish: Costa Rican, Colombian, Mexican and Peruvian.
no code implementations • JEPTALNRECITAL 2014 • Ana{\"\i}s Lefeuvre, Jean-Yves Antoine, Agata Savary, Emmanuel Schang, Lotfi Abouda, Denis Maurel, Iris Eshkol
no code implementations • LREC 2014 • Maciej Ogrodniczuk, Mateusz Kope{\'c}, Agata Savary
Correlation between cluster and mention count within a text is investigated, with short characteristics of outlier cases.