1 code implementation • 28 Mar 2024 • Manuel Tonneau, Pedro Vitor Quinta de Castro, Karim Lasri, Ibrahim Farouq, Lakshminarayanan Subramanian, Victor Orozco-Olvera, Samuel Fraiberger
To address the global issue of hateful content proliferating in online platforms, hate speech detection (HSD) models are typically developed on datasets collected in the United States, thereby failing to generalize to English dialects from the Majority World.
no code implementations • 8 Nov 2022 • Karim Lasri, Alessandro Lenci, Thierry Poibeau
We find that the necessity of position information increases with the amount of masking, and that masked language models without position encodings are not able to reconstruct this information on the task.
no code implementations • 6 Oct 2022 • Dieuwke Hupkes, Mario Giulianelli, Verna Dankers, Mikel Artetxe, Yanai Elazar, Tiago Pimentel, Christos Christodoulopoulos, Karim Lasri, Naomi Saphra, Arabella Sinclair, Dennis Ulmer, Florian Schottmann, Khuyagbaatar Batsuren, Kaiser Sun, Koustuv Sinha, Leila Khalatbari, Maria Ryskina, Rita Frieske, Ryan Cotterell, Zhijing Jin
We present a taxonomy for characterising and understanding generalisation research in NLP.
no code implementations • COLING 2022 • Karim Lasri, Olga Seminck, Alessandro Lenci, Thierry Poibeau
We compare the performance of BERT-base to that of humans, obtained with a psycholinguistic online crowdsourcing experiment.
no code implementations • ACL 2022 • Karim Lasri, Tiago Pimentel, Alessandro Lenci, Thierry Poibeau, Ryan Cotterell
We also find that BERT uses a separate encoding of grammatical number for nouns and verbs.
no code implementations • Findings (ACL) 2022 • Karim Lasri, Alessandro Lenci, Thierry Poibeau
Although transformer-based Neural Language Models demonstrate impressive performance on a variety of tasks, their generalization abilities are not well understood.