1 code implementation • 28 Mar 2024 • Janis Goldzycher, Paul Röttger, Gerold Schneider
Our experiments show that the resulting dataset is challenging even for state-of-the-art hate speech detection models, and that training on GAHD clearly improves model robustness.
no code implementations • 4 Nov 2023 • Patricia Ronan, Gerold Schneider
It proved surprisingly successful in the task of broad phonetic transcription, but performed less well in the analysis of morphemes and phrases.
no code implementations • 27 Jul 2023 • Ahmet Yavuz Uluslu, Gerold Schneider
In this paper, we present the first application of Native Language Identification (NLI) for the Turkish language.
1 code implementation • 6 Jun 2023 • Janis Goldzycher, Moritz Preisig, Chantal Amrhein, Gerold Schneider
In this paper, we test whether natural language inference (NLI) models which perform well in zero- and few-shot settings can benefit hate speech detection performance in scenarios where only a limited amount of labeled data is available in the target language.
no code implementations • 6 Jun 2023 • Ahmet Yavuz Uluslu, Gerold Schneider
This paper presents the first comprehensive study on automatic readability assessment of Turkish texts.
no code implementations • 18 Nov 2022 • Ahmet Yavuz Uluslu, Gerold Schneider
Native language identification (NLI) is the task of automatically identifying the native language (L1) of an individual based on their language production in a learned language.
1 code implementation • TRAC (COLING) 2022 • Janis Goldzycher, Gerold Schneider
We find that the zero-shot baseline used for the initial error analysis already outperforms commercial systems and fine-tuned BERT-based hate speech detection models on HateCheck.
no code implementations • LREC 2020 • Johannes Gra{\"e}n, David Alfter, Gerold Schneider
The Common European Framework of Reference for Languages (CEFR) defines six levels of learner proficiency, and links them to particular communicative abilities.
no code implementations • LREC 2012 • Gerold Schneider, Fabio Rinaldi, Simon Clematide
We give an overview of our approach to the extraction of interactions between pharmacogenomic entities like drugs, genes and diseases and suggest classes of interaction types driven by data from PharmGKB and partly following the top level ontology WordNet and biomedical types from BioNLP.