no code implementations • WS 2019 • Jan {\v{S}}najder, Tamara Sladoljev-Agejev, Svjetlana Koli{\'c} Vehovec
We present a model for automatic scoring of coherence based on comparing the rhetorical structure (RS) of college student summaries in L2 (English) against expert summaries.
no code implementations • WS 2019 • Antonio {\v{S}}ajatovi{\'c}, Maja Buljan, Jan {\v{S}}najder, Bojana Dalbelo Ba{\v{s}}i{\'c}
Automatic Term Extraction (ATE) extracts terminology from domain-specific corpora.
no code implementations • WS 2019 • Mladen Karan, Jan {\v{S}}najder
We address the task of automatically detecting toxic content in user generated texts.
no code implementations • SEMEVAL 2019 • Niko Pali{\'c}, Juraj Vladika, Dominik {\v{C}}ubeli{\'c}, Ivan Lovren{\v{c}}i{\'c}, Maja Buljan, Jan {\v{S}}najder
In this paper, we demonstrate the system built to solve the SemEval-2019 task 4: Hyperpartisan News Detection (Kiesel et al., 2019), the task of automatically determining whether an article is heavily biased towards one side of the political spectrum.
no code implementations • WS 2018 • Mladen Karan, Jan {\v{S}}najder
We investigate to what extent the models trained to detect general abusive language generalize between different datasets labeled with different abusive language types.
no code implementations • WS 2018 • Ivan Sekulic, Matej Gjurkovi{\'c}, Jan {\v{S}}najder
Bipolar disorder, an illness characterized by manic and depressive episodes, affects more than 60 million people worldwide.
no code implementations • COLING 2018 • Viktor Golem, Mladen Karan, Jan {\v{S}}najder
The task, however, is far from being trivial, as what is considered as aggressive speech can be quite subjective, and the task is further complicated by the noisy nature of user-generated text on social networks.
no code implementations • WS 2018 • Matej Gjurkovi{\'c}, Jan {\v{S}}najder
Automated personality prediction from social media is gaining increasing attention in natural language processing and social sciences communities.
no code implementations • NAACL 2018 • Maja Buljan, Sebastian Pad{\'o}, Jan {\v{S}}najder
LexSub is a more natural task, enables us to evaluate meaning composition at the level of individual words, and provides a common ground to compare CDSMs with dedicated LexSub models.
no code implementations • SEMEVAL 2018 • Ana Brassard, Tin Kuculo, Filip Boltu{\v{z}}i{\'c}, Jan {\v{S}}najder
This paper describes our system for the SemEval-2018 Task 12: Argument Reasoning Comprehension Task.
no code implementations • SEMEVAL 2018 • Martin Gluhak, Maria Pia di Buono, Abbas Akkasi, Jan {\v{S}}najder
We describe two systems for semantic relation classification with which we participated in the SemEval 2018 Task 7, subtask 1 on semantic relation classification: an SVM model and a CNN model.
no code implementations • IJCNLP 2017 • Tamara Sladoljev-Agejev, Jan {\v{S}}najder
Assessing summaries is a demanding, yet useful task which provides valuable information on language competence, especially for second language learners.
no code implementations • WS 2017 • Maria Pia di Buono, Jan {\v{S}}najder, Bojana Dalbelo Ba{\v{s}}i{\'c}, Goran Glava{\v{s}}, Martin Tutek, Natasa Milic-Frayling
We present a preliminary study on predicting news values from headline text and emotions.
no code implementations • WS 2017 • Filip Boltu{\v{z}}i{\'c}, Jan {\v{S}}najder
Claims are the building blocks of arguments and the reasons underpinning opinions, thus analyzing claims is important for both argumentation mining and opinion mining.
no code implementations • SEMEVAL 2017 • Zoran Medi{\'c}, Jan {\v{S}}najder, Sebastian Pad{\'o}
The Practical Lexical Function (PLF) model is a model of computational distributional semantics that attempts to strike a balance between expressivity and learnability in predicting phrase meaning and shows competitive results.
no code implementations • SEMEVAL 2017 • David Lozi{\'c}, Doria {\v{S}}ari{\'c}, Ivan Toki{\'c}, Zoran Medi{\'c}, Jan {\v{S}}najder
This paper describes the system we submitted to SemEval-2017 Task 4 (Sentiment Analysis in Twitter), specifically subtasks A, B, and D. Our main focus was topic-based message polarity classification on a two-point scale (subtask B).
no code implementations • SEMEVAL 2017 • Filip {\v{S}}aina, Toni Kukurin, Lukrecija Pulji{\'c}, Mladen Karan, Jan {\v{S}}najder
We use features based on different semantic similarity models (e. g., Latent Dirichlet Allocation), as well as features based on several types of pre-trained word embeddings.
no code implementations • SEMEVAL 2017 • Leon Rotim, Martin Tutek, Jan {\v{S}}najder
Our best-performing submission scored 3rd on the task out of 29 teams and 4th out of 45 submissions with a cosine score of 0. 733.
no code implementations • SEMEVAL 2017 • Marin Kukova{\v{c}}ec, Juraj Malenica, Ivan Mr{\v{s}}i{\'c}, Antonio {\v{S}}ajatovi{\'c}, Domagoj Alagi{\'c}, Jan {\v{S}}najder
This paper describes our system for humor ranking in tweets within the SemEval 2017 Task 6: {\#}HashtagWars (6A and 6B).
no code implementations • WS 2017 • Leon Rotim, Jan {\v{S}}najder
We focus on the task of supervised sentiment classification of short and informal texts in Croatian, using two simple yet effective methods: word embeddings and string kernels.
no code implementations • WS 2017 • Maria Pia di Buono, Martin Tutek, Jan {\v{S}}najder, Goran Glava{\v{s}}, Bojana Dalbelo Ba{\v{s}}i{\'c}, Nata{\v{s}}a Mili{\'c}-Frayling
In this paper, we describe our preliminary study on annotating event mention as a part of our research on high-precision news event extraction models.
no code implementations • WS 2017 • Jakub Piskorski, Lidia Pivovarova, Jan {\v{S}}najder, Josef Steinberger, Roman Yangarber
The reported evaluation figures reflect the relatively higher level of complexity of named entity-related tasks in the context of processing texts in Slavic languages.
no code implementations • WS 2017 • Paula Gombar, Zoran Medi{\'c}, Domagoj Alagi{\'c}, Jan {\v{S}}najder
We experiment with the graph-based acquisition of sentiment lexicons, analyze their quality, and investigate how effectively they can be used in sentiment classification.
no code implementations • WS 2017 • Domagoj Alagi{\'c}, Jan {\v{S}}najder
Lexical substitution is a task of determining a meaning-preserving replacement for a word in context.
no code implementations • WS 2017 • Maja Buljan, Jan {\v{S}}najder
As multiword expressions (MWEs) exhibit a range of idiosyncrasies, their automatic detection warrants the use of many different features.
no code implementations • TACL 2017 • Julian Brooke, Jan {\v{S}}najder, Timothy Baldwin
We present a new model for acquiring comprehensive multiword lexicons from large corpora based on competition among n-gram candidates.
no code implementations • COLING 2016 • Sebastian Pad{\'o}, Aur{\'e}lie Herbelot, Max Kisselew, Jan {\v{S}}najder
Compositional distributional semantic models (CDSMs) have successfully been applied to the task of predicting the meaning of a range of linguistic constructions.
no code implementations • SEMEVAL 2016 • Martin Tutek, Ivan Sekuli{\'c}, Paula Gombar, Ivan Paljak, Filip {\v{C}}ulinovi{\'c}, Filip Boltu{\v{z}}i{\'c}, Mladen Karan, Domagoj Alagi{\'c}, Jan {\v{S}}najder
no code implementations • LREC 2016 • Ivan Sekuli{\'c}, Jan {\v{S}}najder
In this paper we describe VerbCROcean, a broad-coverage repository of fine-grained semantic relations between Croatian verbs.
no code implementations • LREC 2016 • Domagoj Alagi{\'c}, Jan {\v{S}}najder
We introduce Cro36WSD, a freely-available medium-sized lexical sample for Croatian word sense disambiguation (WSD). Cro36WSD comprises 36 words: 12 adjectives, 12 nouns, and 12 verbs, balanced across both frequency bands and polysemy levels.
no code implementations • LREC 2016 • Marko Bekavac, Jan {\v{S}}najder
Word sense induction (WSI) seeks to induce senses of words from unannotated corpora.
no code implementations • LREC 2014 • Jan {\v{S}}najder
We describe an evaluation methodology based on manually constructed derivational families from which we sample and annotate pairs of lemmas.
no code implementations • LREC 2014 • Goran Glava{\v{s}}, Jan {\v{S}}najder, Marie-Francine Moens, Parisa Kordjamshidi
In this work, we present HiEve, a corpus for recognizing relations of spatiotemporal containment between events.
no code implementations • LREC 2012 • Mladen Karan, Jan {\v{S}}najder, Bojana Dalbelo Ba{\v{s}}i{\'c}
Features which contributed the most to overall performance were PMI, semantic relatedness, and POS information.