Search Results for author: Karën Fort

Found 11 papers, 0 papers with code

Use of a Citizen Science Platform for the Creation of a Language Resource to Study Bias in Language Models for French: A Case Study

no code implementations NIDCP (LREC) 2022 Karën Fort, Aurélie Névéol, Yoann Dupont, Julien Bezançon

We created three tasks on the LanguageARC citizen science platform to assist with the translation of an existing resource from English into French as well as the collection of complementary resources in native French.

Fairness Translation

CLISTER : A Corpus for Semantic Textual Similarity in French Clinical Narratives

no code implementations LREC 2022 Nicolas Hiebel, Olivier Ferret, Karën Fort, Aurélie Névéol

We introduce a definition of similarity that is guided by clinical facts and apply it to the development of a new French corpus of 1, 000 sentence pairs manually annotated according to similarity scores.

Semantic Similarity Semantic Textual Similarity +4

Quantification Annotation in ISO 24617-12, Second Draft

no code implementations LREC 2022 Harry Bunt, Maxime Amblard, Johan Bos, Karën Fort, Bruno Guillaume, Philippe de Groote, Chuyuan Li, Pierre Ludmann, Michel Musiol, Siyana Pavlova, Guy Perrier, Sylvain Pogodalla

This paper describes the continuation of a project that aims at establishing an interoperable annotation schema for quantification phenomena as part of the ISO suite of standards for semantic annotation, known as the Semantic Annotation Framework.

Do we Name the Languages we Study? The #BenderRule in LREC and ACL articles

no code implementations LREC 2022 Fanny Ducel, Karën Fort, Gaël Lejeune, Yves Lepage

For this purpose, we created a corpus from LREC and ACL articles from the above-mentioned periods, from which we manually annotated nearly 1, 000.

French CrowS-Pairs: Extending a challenge dataset for measuring social bias in masked language models to a language other than English

no code implementations ACL 2022 Aurélie Névéol, Yoann Dupont, Julien Bezançon, Karën Fort

We build on the US-centered CrowS-pairs dataset to create a multilingual stereotypes dataset that allows for comparability across languages while also characterizing biases that are specific to each country and language.

Investigating Dominant Word Order on Universal Dependencies with Graph Rewriting

no code implementations RANLP 2021 Hee-Soo Choi, Bruno Guillaume, Karën Fort, Guy Perrier

This paper details experiments we performed on the Universal Dependencies 2. 7 corpora in order to investigate the dominant word order in the available languages.

Cannot find the paper you are looking for? You can Submit a new open access paper.