no code implementations • SIGDIAL (ACL) 2021 • Mladen Karan, Prashant Khare, Patrick Healey, Matthew Purver
This work revisits the task of detecting decision-related utterances in multi-party dialogue.
no code implementations • 11 Nov 2022 • Ravi Shekhar, Mladen Karan, Matthew Purver
In light of unprecedented increases in the popularity of the internet and social media, comment moderation has never been a more relevant task.
1 code implementation • RANLP 2021 • Elaine Zosa, Ravi Shekhar, Mladen Karan, Matthew Purver
Moderation of reader comments is a significant problem for online news platforms.
no code implementations • COLING 2020 • Goran Glava{\v{s}}, Mladen Karan, Ivan Vuli{\'c}
We present XHate-999, a multi-domain and multilingual evaluation data set for abusive language detection.
no code implementations • ACL 2020 • Mladen Karan, Ivan Vuli{\'c}, Anna Korhonen, Goran Glava{\v{s}}
Effective projection-based cross-lingual word embedding (CLWE) induction critically relies on the iterative self-learning procedure.
no code implementations • NAACL (SocialNLP) 2021 • Matej Gjurković, Mladen Karan, Iva Vukojević, Mihaela Bošnjak, Jan Šnajder
Personality and demographics are important variables in social sciences, while in NLP they can aid in interpretability and removal of societal biases.
no code implementations • WS 2019 • Mladen Karan, Jan {\v{S}}najder
We address the task of automatically detecting toxic content in user generated texts.
no code implementations • WS 2019 • Mihaela Bo{\v{s}}njak, Mladen Karan
Nowadays it is becoming more important than ever to find new ways of extracting useful information from the evergrowing amount of user-generated data available online.
no code implementations • WS 2018 • Mladen Karan, Jan {\v{S}}najder
We investigate to what extent the models trained to detect general abusive language generalize between different datasets labeled with different abusive language types.
no code implementations • COLING 2018 • Viktor Golem, Mladen Karan, Jan {\v{S}}najder
The task, however, is far from being trivial, as what is considered as aggressive speech can be quite subjective, and the task is further complicated by the noisy nature of user-generated text on social networks.
no code implementations • SEMEVAL 2017 • Filip {\v{S}}aina, Toni Kukurin, Lukrecija Pulji{\'c}, Mladen Karan, Jan {\v{S}}najder
We use features based on different semantic similarity models (e. g., Latent Dirichlet Allocation), as well as features based on several types of pre-trained word embeddings.
no code implementations • SEMEVAL 2016 • Martin Tutek, Ivan Sekuli{\'c}, Paula Gombar, Ivan Paljak, Filip {\v{C}}ulinovi{\'c}, Filip Boltu{\v{z}}i{\'c}, Mladen Karan, Domagoj Alagi{\'c}, Jan {\v{S}}najder
no code implementations • LREC 2012 • Mladen Karan, Jan {\v{S}}najder, Bojana Dalbelo Ba{\v{s}}i{\'c}
Features which contributed the most to overall performance were PMI, semantic relatedness, and POS information.