no code implementations • VarDial (COLING) 2020 • Çağrı Çöltekin
This paper describes a set of experiments for discriminating between two closely related language varieties, Moldavian and Romanian, under a substantial domain shift.
no code implementations • NAACL (CMCL) 2021 • Alisan Balkoca, Abdullah Algan, Cengiz Acarturk, Çağrı Çöltekin
This system description paper describes our participation in CMCL 2021 shared task on predicting human reading patterns.
no code implementations • RANLP 2021 • Mihai Manolescu, Çağrı Çöltekin
This paper describes the annotation process of an offensive language data set for Romanian on social media.
no code implementations • UDW (COLING) 2020 • Çağrı Çöltekin
As in any field of inquiry that depends on experiments, the verifiability of experimental studies is important in computational linguistics.
no code implementations • ParlaCLARIN (LREC) 2022 • Maciej Ogrodniczuk, Petya Osenova, Tomaž Erjavec, Darja Fišer, Nikola Ljubešić, Çağrı Çöltekin, Matyáš Kopp, Meden Katja
In ParlaMint I, a CLARIN-ERIC supported project in pandemic times, a set of comparable and uniformly annotated multilingual corpora for 17 national parliaments were developed and released in 2021.
no code implementations • ParlaCLARIN (LREC) 2022 • Gül M. Kurtoğlu Eskişar, Çağrı Çöltekin
We present the initial results of our quantitative study on emotions (Anger, Disgust, Fear, Happiness, Sadness and Surprise) in Turkish parliament (2011–2021).
1 code implementation • LREC 2022 • Diana Constantina Hoefels, Çağrı Çöltekin, Irina Diana Mădroane
This paper introduces CoRoSeOf, a large corpus of Romanian social media manually annotated for sexist and offensive language.
no code implementations • 31 May 2023 • Noëmi Aepli, Çağrı Çöltekin, Rob van der Goot, Tommi Jauhiainen, Mourhaf Kazzaz, Nikola Ljubešić, Kai North, Barbara Plank, Yves Scherrer, Marcos Zampieri
This report presents the results of the shared tasks organized as part of the VarDial Evaluation Campaign 2023.
1 code implementation • 11 Apr 2022 • Çağrı Çöltekin, Taraka Rama
We present an analysis of eight measures used for quantifying morphological complexity of natural languages.
no code implementations • 11 Apr 2022 • Çağrı Çöltekin, A. Seza Doğruöz, Özlem Çetinoğlu
This paper presents a comprehensive survey of corpora and lexical resources available for Turkish.
no code implementations • SEMEVAL 2020 • Marcos Zampieri, Preslav Nakov, Sara Rosenthal, Pepa Atanasova, Georgi Karadzhov, Hamdy Mubarak, Leon Derczynski, Zeses Pitenis, Çağrı Çöltekin
We present the results and main findings of SemEval-2020 Task 12 on Multilingual Offensive Language Identification in Social Media (OffensEval 2020).
no code implementations • 13 Sep 2018 • Çağrı Çöltekin, Taraka Rama
We experimented with a number of different models, including recurrent and convolutional networks, Poisson regression, support vector regression, and L1 and L2 regularized linear regression.