Search Results for author: Karën Fort

Found 12 papers, 2 papers with code

Investigating Dominant Word Order on Universal Dependencies with Graph Rewriting

no code implementations • RANLP 2021 • Hee-Soo Choi, Bruno Guillaume, Karën Fort, Guy Perrier

This paper details experiments we performed on the Universal Dependencies 2. 7 corpora in order to investigate the dominant word order in the available languages.

Paper
Add Code

FENEC : un corpus équilibré pour l’évaluation des entités nommées en français (FENEC : a balanced sample corpus for French named entity recognition )

1 code implementation • JEP/TALN/RECITAL 2022 • Alice Millour, Yoann Dupont, Alexane Jouglar, Karën Fort

Nous présentons ici FENEC (FrEnch Named-entity Evaluation Corpus), un corpus à échantillons équilibrés contenant six genres, annoté en entités nommées selon le schéma fin Quæro.

named-entity-recognition Named Entity Recognition +1

Paper
Code

Langues par défaut? Analyse contrastive et diachronique des langues non citées dans les articles de TALN et d’ACL (Contrastive and diachronic study of unmentioned (by default ?) languages in TALN and ACL We study the application of the #BenderRule in natural language processing articles, taking into account a contrastive and a diachronic dimensions, by examining the proceedings of two NLP conferences, TALN and ACL, over time)

no code implementations • JEP/TALN/RECITAL 2022 • Fanny Ducel, Karën Fort, Gaël Lejeune, Yves Lepage

Cet article étudie l’application de la #RègledeBender dans des articles de traitement automatique des langues (TAL), en prenant en compte une dimension contrastive, par l’examen des actes de deux conférences du domaine, TALN et ACL, et une dimension diachronique, en examinant ces conférences au fil du temps.

Paper
Add Code

CLISTER : Un corpus pour la similarité sémantique textuelle dans des cas cliniques en français (CLISTER : A Corpus for Semantic Textual Similarity in French Clinical Narratives)

no code implementations • JEP/TALN/RECITAL 2022 • Nicolas Hiebel, Karën Fort, Aurélie Névéol, Olivier Ferret

Le TAL repose sur la disponibilité de corpus annotés pour l’entraînement et l’évaluation de modèles.

Semantic Textual Similarity STS

Paper
Add Code

French CrowS-Pairs: Extension à une langue autre que l’anglais d’un corpus de mesure des biais sociétaux dans les modèles de langue masqués (French CrowS-Pairs : Extending a challenge dataset for measuring social bias in masked language models to a language other than English)

no code implementations • JEP/TALN/RECITAL 2022 • Aurélie Névéol, Yoann Dupont, Julien Bezançon, Karën Fort

Nous montrons que quatre modèles de langue favorisent les énoncés qui expriment des stéréotypes dans la plupart des catégories.

Paper
Add Code

Do we Name the Languages we Study? The #BenderRule in LREC and ACL articles

no code implementations • LREC 2022 • Fanny Ducel, Karën Fort, Gaël Lejeune, Yves Lepage

For this purpose, we created a corpus from LREC and ACL articles from the above-mentioned periods, from which we manually annotated nearly 1, 000.

Paper
Add Code

Use of a Citizen Science Platform for the Creation of a Language Resource to Study Bias in Language Models for French: A Case Study

no code implementations • NIDCP (LREC) 2022 • Karën Fort, Aurélie Névéol, Yoann Dupont, Julien Bezançon

We created three tasks on the LanguageARC citizen science platform to assist with the translation of an existing resource from English into French as well as the collection of complementary resources in native French.

Fairness Translation

Paper
Add Code

Quantification Annotation in ISO 24617-12, Second Draft

no code implementations • LREC 2022 • Harry Bunt, Maxime Amblard, Johan Bos, Karën Fort, Bruno Guillaume, Philippe de Groote, Chuyuan Li, Pierre Ludmann, Michel Musiol, Siyana Pavlova, Guy Perrier, Sylvain Pogodalla

This paper describes the continuation of a project that aims at establishing an interoperable annotation schema for quantification phenomena as part of the ISO suite of standards for semantic annotation, known as the Semantic Annotation Framework.

Paper
Add Code

CLISTER : A Corpus for Semantic Textual Similarity in French Clinical Narratives

no code implementations • LREC 2022 • Nicolas Hiebel, Olivier Ferret, Karën Fort, Aurélie Névéol

We introduce a definition of similarity that is guided by clinical facts and apply it to the development of a new French corpus of 1, 000 sentence pairs manually annotated according to similarity scores.

Paper
Add Code

Corpus-based language universals analysis using Universal Dependencies

no code implementations • Quasy (SyntaxFest) 2021 • Hee-Soo Choi, Bruno Guillaume, Karën Fort

Paper
Add Code

French CrowS-Pairs: Extending a challenge dataset for measuring social bias in masked language models to a language other than English

no code implementations • ACL 2022 • Aurélie Névéol, Yoann Dupont, Julien Bezançon, Karën Fort

We build on the US-centered CrowS-pairs dataset to create a multilingual stereotypes dataset that allows for comparability across languages while also characterizing biases that are specific to each country and language.

Sentence

Paper
Add Code

The Elephant in the Room: Analyzing the Presence of Big Tech in Natural Language Processing Research

1 code implementation • 4 May 2023 • Mohamed Abdalla, Jan Philip Wahle, Terry Ruas, Aurélie Névéol, Fanny Ducel, Saif M. Mohammad, Karën Fort

Recent advances in deep learning methods for natural language processing (NLP) have created new business opportunities and made NLP research critical for industry development.

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.