Search Results for author: Jan {\v{S}}najder

Found 54 papers, 0 papers with code

Analysing Rhetorical Structure as a Key Feature of Summary Coherence

no code implementations • WS 2019 • Jan {\v{S}}najder, Tamara Sladoljev-Agejev, Svjetlana Koli{\'c} Vehovec

We present a model for automatic scoring of coherence based on comparing the rhetorical structure (RS) of college student summaries in L2 (English) against expert summaries.

regression Relation

Paper
Add Code

Evaluating Automatic Term Extraction Methods on Individual Documents

no code implementations • WS 2019 • Antonio {\v{S}}ajatovi{\'c}, Maja Buljan, Jan {\v{S}}najder, Bojana Dalbelo Ba{\v{s}}i{\'c}

Automatic Term Extraction (ATE) extracts terminology from domain-specific corpora.

Term Extraction Translation

Paper
Add Code

Preemptive Toxic Language Detection in Wikipedia Comments Using Thread-Level Context

no code implementations • WS 2019 • Mladen Karan, Jan {\v{S}}najder

We address the task of automatically detecting toxic content in user generated texts.

Paper
Add Code

TakeLab at SemEval-2019 Task 4: Hyperpartisan News Detection

no code implementations • SEMEVAL 2019 • Niko Pali{\'c}, Juraj Vladika, Dominik {\v{C}}ubeli{\'c}, Ivan Lovren{\v{c}}i{\'c}, Maja Buljan, Jan {\v{S}}najder

In this paper, we demonstrate the system built to solve the SemEval-2019 task 4: Hyperpartisan News Detection (Kiesel et al., 2019), the task of automatically determining whether an article is heavily biased towards one side of the political spectrum.

Paper
Add Code

Cross-Domain Detection of Abusive Language Online

no code implementations • WS 2018 • Mladen Karan, Jan {\v{S}}najder

We investigate to what extent the models trained to detect general abusive language generalize between different datasets labeled with different abusive language types.

Abusive Language Domain Adaptation +1

Paper
Add Code

Not Just Depressed: Bipolar Disorder Prediction on Reddit

no code implementations • WS 2018 • Ivan Sekulic, Matej Gjurkovi{\'c}, Jan {\v{S}}najder

Bipolar disorder, an illness characterized by manic and depressive episodes, affects more than 60 million people worldwide.

Paper
Add Code

Combining Shallow and Deep Learning for Aggressive Text Detection

no code implementations • COLING 2018 • Viktor Golem, Mladen Karan, Jan {\v{S}}najder

The task, however, is far from being trivial, as what is considered as aggressive speech can be quite subjective, and the task is further complicated by the noisy nature of user-generated text on social networks.

BIG-bench Machine Learning Text Detection

Paper
Add Code

Reddit: A Gold Mine for Personality Prediction

no code implementations • WS 2018 • Matej Gjurkovi{\'c}, Jan {\v{S}}najder

Automated personality prediction from social media is gaining increasing attention in natural language processing and social sciences communities.

Type prediction

Paper
Add Code

Lexical Substitution for Evaluating Compositional Distributional Models

no code implementations • NAACL 2018 • Maja Buljan, Sebastian Pad{\'o}, Jan {\v{S}}najder

LexSub is a more natural task, enables us to evaluate meaning composition at the level of individual words, and provides a common ground to compare CDSMs with dedicated LexSub models.

Natural Language Inference Sentence +2

Paper
Add Code

TakeLab at SemEval-2018 Task12: Argument Reasoning Comprehension with Skip-Thought Vectors

no code implementations • SEMEVAL 2018 • Ana Brassard, Tin Kuculo, Filip Boltu{\v{z}}i{\'c}, Jan {\v{S}}najder

This paper describes our system for the SemEval-2018 Task 12: Argument Reasoning Comprehension Task.

Common Sense Reasoning Natural Language Inference +2

Paper
Add Code

TakeLab at SemEval-2018 Task 7: Combining Sparse and Dense Features for Relation Classification in Scientific Texts

no code implementations • SEMEVAL 2018 • Martin Gluhak, Maria Pia di Buono, Abbas Akkasi, Jan {\v{S}}najder

We describe two systems for semantic relation classification with which we participated in the SemEval 2018 Task 7, subtask 1 on semantic relation classification: an SVM model and a CNN model.

General Classification Relation +1

Paper
Add Code

Using Analytic Scoring Rubrics in the Automatic Assessment of College-Level Summary Writing Tasks in L2

no code implementations • IJCNLP 2017 • Tamara Sladoljev-Agejev, Jan {\v{S}}najder

Assessing summaries is a demanding, yet useful task which provides valuable information on language competence, especially for second language learners.

Reading Comprehension regression

Paper
Add Code

Predicting News Values from Headline Text and Emotions

no code implementations • WS 2017 • Maria Pia di Buono, Jan {\v{S}}najder, Bojana Dalbelo Ba{\v{s}}i{\'c}, Goran Glava{\v{s}}, Martin Tutek, Natasa Milic-Frayling

We present a preliminary study on predicting news values from headline text and emotions.

BIG-bench Machine Learning Recommendation Systems

Paper
Add Code

Toward Stance Classification Based on Claim Microstructures

no code implementations • WS 2017 • Filip Boltu{\v{z}}i{\'c}, Jan {\v{S}}najder

Claims are the building blocks of arguments and the reasons underpinning opinions, thus analyzing claims is important for both argumentation mining and opinion mining.

Argument Mining Classification +4

Paper
Add Code

Does Free Word Order Hurt? Assessing the Practical Lexical Function Model for Croatian

no code implementations • SEMEVAL 2017 • Zoran Medi{\'c}, Jan {\v{S}}najder, Sebastian Pad{\'o}

The Practical Lexical Function (PLF) model is a model of computational distributional semantics that attempts to strike a balance between expressivity and learnability in predicting phrase meaning and shows competitive results.

Semantic Textual Similarity

Paper
Add Code

TakeLab at SemEval-2017 Task 4: Recent Deaths and the Power of Nostalgia in Sentiment Analysis in Twitter

no code implementations • SEMEVAL 2017 • David Lozi{\'c}, Doria {\v{S}}ari{\'c}, Ivan Toki{\'c}, Zoran Medi{\'c}, Jan {\v{S}}najder

This paper describes the system we submitted to SemEval-2017 Task 4 (Sentiment Analysis in Twitter), specifically subtasks A, B, and D. Our main focus was topic-based message polarity classification on a two-point scale (subtask B).

Sentiment Analysis Word Embeddings

Paper
Add Code

TakeLab-QA at SemEval-2017 Task 3: Classification Experiments for Answer Retrieval in Community QA

no code implementations • SEMEVAL 2017 • Filip {\v{S}}aina, Toni Kukurin, Lukrecija Pulji{\'c}, Mladen Karan, Jan {\v{S}}najder

We use features based on different semantic similarity models (e. g., Latent Dirichlet Allocation), as well as features based on several types of pre-trained word embeddings.

Community Question Answering General Classification +6

Paper
Add Code

TakeLab at SemEval-2017 Task 5: Linear aggregation of word embeddings for fine-grained sentiment analysis of financial news

no code implementations • SEMEVAL 2017 • Leon Rotim, Martin Tutek, Jan {\v{S}}najder

Our best-performing submission scored 3rd on the task out of 29 teams and 4th out of 45 submissions with a cosine score of 0. 733.

Feature Engineering regression +2

Paper
Add Code

TakeLab at SemEval-2017 Task 6: \#RankingHumorIn4Pages

no code implementations • SEMEVAL 2017 • Marin Kukova{\v{c}}ec, Juraj Malenica, Ivan Mr{\v{s}}i{\'c}, Antonio {\v{S}}ajatovi{\'c}, Domagoj Alagi{\'c}, Jan {\v{S}}najder

This paper describes our system for humor ranking in tweets within the SemEval 2017 Task 6: {\#}HashtagWars (6A and 6B).

Common Sense Reasoning Humor Detection

Paper
Add Code

Comparison of Short-Text Sentiment Analysis Methods for Croatian

no code implementations • WS 2017 • Leon Rotim, Jan {\v{S}}najder

We focus on the task of supervised sentiment classification of short and informal texts in Croatian, using two simple yet effective methods: word embeddings and string kernels.

General Classification Sentiment Analysis +4

Paper
Add Code

Two Layers of Annotation for Representing Event Mentions in News Stories

no code implementations • WS 2017 • Maria Pia di Buono, Martin Tutek, Jan {\v{S}}najder, Goran Glava{\v{s}}, Bojana Dalbelo Ba{\v{s}}i{\'c}, Nata{\v{s}}a Mili{\'c}-Frayling

In this paper, we describe our preliminary study on annotating event mention as a part of our research on high-precision news event extraction models.

Event Extraction Vocal Bursts Valence Prediction

Paper
Add Code

The First Cross-Lingual Challenge on Recognition, Normalization, and Matching of Named Entities in Slavic Languages

no code implementations • WS 2017 • Jakub Piskorski, Lidia Pivovarova, Jan {\v{S}}najder, Josef Steinberger, Roman Yangarber

The reported evaluation figures reflect the relatively higher level of complexity of named entity-related tasks in the context of processing texts in Slavic languages.

Entity Linking Lemmatization +3

Paper
Add Code

Debunking Sentiment Lexicons: A Case of Domain-Specific Sentiment Classification for Croatian

no code implementations • WS 2017 • Paula Gombar, Zoran Medi{\'c}, Domagoj Alagi{\'c}, Jan {\v{S}}najder

We experiment with the graph-based acquisition of sentiment lexicons, analyze their quality, and investigate how effectively they can be used in sentiment classification.

Classification General Classification +4

Paper
Add Code

A Preliminary Study of Croatian Lexical Substitution

no code implementations • WS 2017 • Domagoj Alagi{\'c}, Jan {\v{S}}najder

Lexical substitution is a task of determining a meaning-preserving replacement for a word in context.

Information Retrieval Machine Translation +1

Paper
Add Code

Combining Linguistic Features for the Detection of Croatian Multiword Expressions

no code implementations • WS 2017 • Maja Buljan, Jan {\v{S}}najder

As multiword expressions (MWEs) exhibit a range of idiosyncrasies, their automatic detection warrants the use of many different features.

Paper
Add Code

Unsupervised Acquisition of Comprehensive Multiword Lexicons using Competition in an n-gram Lattice

no code implementations • TACL 2017 • Julian Brooke, Jan {\v{S}}najder, Timothy Baldwin

We present a new model for acquiring comprehensive multiword lexicons from large corpora based on competition among n-gram candidates.

Paper
Add Code

Predictability of Distributional Semantics in Derivational Word Formation

no code implementations • COLING 2016 • Sebastian Pad{\'o}, Aur{\'e}lie Herbelot, Max Kisselew, Jan {\v{S}}najder

Compositional distributional semantic models (CDSMs) have successfully been applied to the task of predicting the meaning of a range of linguistic constructions.

Machine Translation regression +2

Paper
Add Code

Fill the Gap! Analyzing Implicit Premises between Claims from Online Debates

no code implementations • WS 2016 • Filip Boltu{\v{z}}i{\'c}, Jan {\v{S}}najder

Argument Mining Opinion Mining

Paper
Add Code

Analysis of Policy Agendas: Lessons Learned from Automatic Topic Classification of Croatian Political Texts

no code implementations • WS 2016 • Mladen Karan, Jan {\v{S}}najder, Daniela {\v{S}}irini{\'c}, Goran Glava{\v{s}}

Decision Making General Classification +1

Paper
Add Code

TakeLab at SemEval-2016 Task 6: Stance Classification in Tweets Using a Genetic Algorithm Based Ensemble

no code implementations • SEMEVAL 2016 • Martin Tutek, Ivan Sekuli{\'c}, Paula Gombar, Ivan Paljak, Filip {\v{C}}ulinovi{\'c}, Filip Boltu{\v{z}}i{\'c}, Mladen Karan, Domagoj Alagi{\'c}, Jan {\v{S}}najder

General Classification Stance Classification +1

Paper
Add Code

VerbCROcean: A Repository of Fine-Grained Semantic Verb Relations for Croatian

no code implementations • LREC 2016 • Ivan Sekuli{\'c}, Jan {\v{S}}najder

In this paper we describe VerbCROcean, a broad-coverage repository of fine-grained semantic relations between Croatian verbs.

Relation

Paper
Add Code

Cro36WSD: A Lexical Sample for Croatian Word Sense Disambiguation

no code implementations • LREC 2016 • Domagoj Alagi{\'c}, Jan {\v{S}}najder

We introduce Cro36WSD, a freely-available medium-sized lexical sample for Croatian word sense disambiguation (WSD). Cro36WSD comprises 36 words: 12 adjectives, 12 nouns, and 12 verbs, balanced across both frequency bands and polysemy levels.

Word Sense Disambiguation