Search Results for author: Antonis Maronikolakis

Found 12 papers, 4 papers with code

Wine is not v i n. On the Compatibility of Tokenizations across Languages

no code implementations Findings (EMNLP) 2021 Antonis Maronikolakis, Philipp Dufter, Hinrich Schütze

The size of the vocabulary is a central design choice in large pretrained language models, with respect to both performance and memory requirements.

Multidomain Pretrained Language Models for Green NLP

1 code implementation EACL (AdaptNLP) 2021 Antonis Maronikolakis, Hinrich Schütze

Thus, instead of training multiple models, we can train a single multidomain model saving on computational resources and training time.

Domain Adaptation

Separating Hate Speech and Offensive Language Classes via Adversarial Debiasing

1 code implementation NAACL (WOAH) 2022 Shuzhou Yuan, Antonis Maronikolakis, Hinrich Schütze

Research to tackle hate speech plaguing online media has made strides in providing solutions, analyzing bias and curating data.

Sociocultural knowledge is needed for selection of shots in hate speech detection tasks

no code implementations4 Apr 2023 Antonis Maronikolakis, Abdullatif Köksal, Hinrich Schütze

We introduce HATELEXICON, a lexicon of slurs and targets of hate speech for the countries of Brazil, Germany, India and Kenya, to aid training and interpretability of models.

Few-Shot Learning Hate Speech Detection

Analyzing Hate Speech Data along Racial, Gender and Intersectional Axes

no code implementations NAACL (GeBNLP) 2022 Antonis Maronikolakis, Philip Baader, Hinrich Schütze

To tackle the rising phenomenon of hate speech, efforts have been made towards data curation and analysis.

Listening to Affected Communities to Define Extreme Speech: Dataset and Experiments

1 code implementation Findings (ACL) 2022 Antonis Maronikolakis, Axel Wisiorek, Leah Nann, Haris Jabbar, Sahana Udupa, Hinrich Schuetze

Building on current work on multilingual hate speech (e. g., Ousidhoum et al. (2019)) and hate speech reduction (e. g., Sap et al. (2020)), we present XTREMESPEECH, a new hate speech dataset containing 20, 297 social media passages from Brazil, Germany, India and Kenya.

BERT Cannot Align Characters

no code implementations EMNLP (insights) 2021 Antonis Maronikolakis, Philipp Dufter, Hinrich Schütze

We show that the closer two languages are, the better BERT can align them on the character level.

Wine is Not v i n. -- On the Compatibility of Tokenizations Across Languages

no code implementations13 Sep 2021 Antonis Maronikolakis, Philipp Dufter, Hinrich Schütze

The size of the vocabulary is a central design choice in large pretrained language models, with respect to both performance and memory requirements.

Identifying Automatically Generated Headlines using Transformers

no code implementations NAACL (NLP4IF) 2021 Antonis Maronikolakis, Hinrich Schutze, Mark Stevenson

False information spread via the internet and social media influences public opinion and user activity, while generative models enable fake content to be generated faster and more cheaply than had previously been possible.

Misinformation

Analyzing Political Parody in Social Media

no code implementations ACL 2020 Antonis Maronikolakis, Danae Sanchez Villegas, Daniel Preotiuc-Pietro, Nikolaos Aletras

Parody is a figurative device used to imitate an entity for comedic or critical purposes and represents a widespread phenomenon in social media through many popular parody accounts.

Fact Checking Sentiment Analysis

Cannot find the paper you are looking for? You can Submit a new open access paper.