no code implementations • Findings (EMNLP) 2021 • Antonis Maronikolakis, Philipp Dufter, Hinrich Schütze
The size of the vocabulary is a central design choice in large pretrained language models, with respect to both performance and memory requirements.
1 code implementation • EACL (AdaptNLP) 2021 • Antonis Maronikolakis, Hinrich Schütze
Thus, instead of training multiple models, we can train a single multidomain model saving on computational resources and training time.
1 code implementation • NAACL (WOAH) 2022 • Shuzhou Yuan, Antonis Maronikolakis, Hinrich Schütze
Research to tackle hate speech plaguing online media has made strides in providing solutions, analyzing bias and curating data.
1 code implementation • 16 Jun 2023 • Victor Steinborn, Antonis Maronikolakis, Hinrich Schütze
Non-English bias research, however, is still in its infancy with most work focusing on English.
no code implementations • 4 Apr 2023 • Antonis Maronikolakis, Abdullatif Köksal, Hinrich Schütze
We introduce HATELEXICON, a lexicon of slurs and targets of hate speech for the countries of Brazil, Germany, India and Kenya, to aid training and interpretability of models.
no code implementations • 25 Oct 2022 • Junze Li, Mengjie Zhao, Yubo Xie, Antonis Maronikolakis, Pearl Pu, Hinrich Schütze
Humor is a magnetic component in everyday human interactions and communications.
no code implementations • NAACL (GeBNLP) 2022 • Antonis Maronikolakis, Philip Baader, Hinrich Schütze
To tackle the rising phenomenon of hate speech, efforts have been made towards data curation and analysis.
1 code implementation • Findings (ACL) 2022 • Antonis Maronikolakis, Axel Wisiorek, Leah Nann, Haris Jabbar, Sahana Udupa, Hinrich Schuetze
Building on current work on multilingual hate speech (e. g., Ousidhoum et al. (2019)) and hate speech reduction (e. g., Sap et al. (2020)), we present XTREMESPEECH, a new hate speech dataset containing 20, 297 social media passages from Brazil, Germany, India and Kenya.
no code implementations • EMNLP (insights) 2021 • Antonis Maronikolakis, Philipp Dufter, Hinrich Schütze
We show that the closer two languages are, the better BERT can align them on the character level.
no code implementations • 13 Sep 2021 • Antonis Maronikolakis, Philipp Dufter, Hinrich Schütze
The size of the vocabulary is a central design choice in large pretrained language models, with respect to both performance and memory requirements.
no code implementations • NAACL (NLP4IF) 2021 • Antonis Maronikolakis, Hinrich Schutze, Mark Stevenson
False information spread via the internet and social media influences public opinion and user activity, while generative models enable fake content to be generated faster and more cheaply than had previously been possible.
no code implementations • ACL 2020 • Antonis Maronikolakis, Danae Sanchez Villegas, Daniel Preotiuc-Pietro, Nikolaos Aletras
Parody is a figurative device used to imitate an entity for comedic or critical purposes and represents a widespread phenomenon in social media through many popular parody accounts.