1 code implementation • EMNLP (NLP-COVID19) 2020 • Abdullatif Köksal, Hilal Dönmez, Rıza Özçelik, Elif Ozkirimli, Arzucan Özgür
Coronavirus Disease of 2019 (COVID-19) created dire consequences globally and triggered an intense scientific effort from different domains.
no code implementations • 28 Nov 2024 • Marion Thaler, Abdullatif Köksal, Alina Leidinger, Anna Korhonen, Hinrich Schütze
Our findings reveal that biases present in pre-training data are amplified in model outputs.
no code implementations • 16 Oct 2024 • Mete Ismayilzada, Defne Circi, Jonne Sälevä, Hale Sirin, Abdullatif Köksal, Bhuwan Dhingra, Antoine Bosselut, Lonneke van der Plas, Duygu Ataman
While humans exhibit compositional generalization and linguistic creativity in language use, the extent to which LLMs replicate these abilities, particularly in morphology, is under-explored.
1 code implementation • 19 Sep 2024 • Abdullatif Köksal, Marion Thaler, Ayyoob Imani, Ahmet Üstün, Anna Korhonen, Hinrich Schütze
Instruction tuning enhances large language models (LLMs) by aligning them with human preferences across diverse tasks.
1 code implementation • 3 Sep 2024 • Ingo Ziegler, Abdullatif Köksal, Desmond Elliott, Hinrich Schütze
Building high-quality datasets for specialized tasks is a time-consuming and resource-intensive process that often requires specialized domain knowledge.
1 code implementation • 30 Aug 2024 • Raoyuan Zhao, Abdullatif Köksal, Yihong Liu, Leonie Weissweiler, Anna Korhonen, Hinrich Schütze
In this work, we propose SYNTHEVAL, a hybrid behavioral testing framework that leverages large language models (LLMs) to generate a wide range of test types for a comprehensive evaluation of NLP models.
1 code implementation • 17 Jul 2024 • Arda Yüksel, Abdullatif Köksal, Lütfi Kerem Şenel, Anna Korhonen, Hinrich Schütze
These questions are written by curriculum experts, suitable for the high-school curricula in Turkey, covering subjects ranging from natural sciences and math questions to more culturally representative topics such as Turkish Literature and the history of the Turkish Republic.
1 code implementation • 9 Jul 2024 • Ali Modarressi, Abdullatif Köksal, Hinrich Schütze
We first demonstrate that models trained on factual data exhibit inconsistent behavior: while they accurately extract triples from factual data, they fail to extract the same triples after counterfactual modification.
no code implementations • 17 Apr 2024 • Ali Modarressi, Abdullatif Köksal, Ayyoob Imani, Mohsen Fayyaz, Hinrich Schütze
While current large language models (LLMs) demonstrate some capabilities in knowledge-intensive tasks, they are limited by relying on their parameters as an implicit storage mechanism.
no code implementations • 11 Mar 2024 • Leonie Weissweiler, Abdullatif Köksal, Hinrich Schütze
Argument Structure Constructions (ASCs) are one of the most well-studied construction groups, providing a unique opportunity to demonstrate the usefulness of Construction Grammar (CxG).
no code implementations • 13 Nov 2023 • Abdullatif Köksal, Renat Aksitov, Chung-Ching Chang
For open book QA as a case study, we demonstrate that models finetuned with our counterfactual datasets improve text grounding, leading to better open book QA performance, with up to an 8. 0% increase in F1 score.
1 code implementation • 22 May 2023 • Abdullatif Köksal, Omer Faruk Yalcin, Ahmet Akbiyik, M. Tahir Kilavuz, Anna Korhonen, Hinrich Schütze
For nationality as a case study, we show that LABDet `surfaces' nationality bias by training a classifier on top of a frozen PLM on non-nationality sentiment detection.
2 code implementations • 17 Apr 2023 • Abdullatif Köksal, Timo Schick, Anna Korhonen, Hinrich Schütze
We generate instructions via LLMs for human-written corpus examples using reverse instructions.
no code implementations • 4 Apr 2023 • Antonis Maronikolakis, Abdullatif Köksal, Hinrich Schütze
We introduce HATELEXICON, a lexicon of slurs and targets of hate speech for the countries of Brazil, Germany, India and Kenya, to aid training and interpretability of models.
1 code implementation • 15 Nov 2022 • Abdullatif Köksal, Timo Schick, Hinrich Schütze
Few-shot classification has made great strides due to foundation models that, through priming and prompting, are highly effective few-shot learners.
no code implementations • 24 Oct 2022 • Leonie Weissweiler, Valentin Hofmann, Abdullatif Köksal, Hinrich Schütze
Construction Grammar (CxG) is a paradigm from cognitive linguistics emphasising the connection between syntax and semantics.
1 code implementation • 12 Oct 2022 • Abdullatif Köksal, Silvia Severini, Hinrich Schütze
Word alignments are essential for a variety of NLP tasks.
2 code implementations • EMNLP 2021 • Yi Huang, Buse Giledereli, Abdullatif Köksal, Arzucan Özgür, Elif Ozkirimli
Here, we introduce the application of balancing loss functions for multi-label text classification.
Ranked #1 on Multi-Label Text Classification on Reuters-21578
1 code implementation • 19 Oct 2020 • Abdullatif Köksal, Arzucan Özgür
Relation classification is one of the key topics in information extraction, which can be used to construct knowledge bases or to provide useful information for question answering.
no code implementations • 5 Sep 2020 • Abdullatif Köksal, Hilal Dönmez, Rıza Özçelik, Elif Ozkirimli, Arzucan Özgür
Coronavirus Disease of 2019 (COVID-19) created dire consequences globally and triggered an intense scientific effort from different domains.
2 code implementations • 24 Feb 2020 • Utku Türk, Furkan Atmaca, Şaziye Betül Özateş, Gözde Berk, Seyyit Talha Bedir, Abdullatif Köksal, Balkız Öztürk Başaran, Tunga Güngör, Arzucan Özgür
In addition, we report the parsing results of a state-of-the-art dependency parser obtained over the BOUN Treebank as well as two other treebanks in Turkish.
Cultural Vocal Bursts Intensity Prediction Dependency Parsing