Search Results for author: Guy De Pauw

Found 11 papers, 2 papers with code

4chan & 8chan embeddings

no code implementations2 Apr 2020 Pierre Voué, Tom De Smedt, Guy De Pauw

We have collected over 30M messages from the publicly available /pol/ message boards on 4chan and 8chan, and compiled them into a model of toxic language use.

Hate Speech Detection Word Embeddings

A weakly supervised sequence tagging and grammar induction approach to semantic frame slot filling

no code implementations15 Jun 2019 Janneke van de Loo, Guy De Pauw, Walter Daelemans

This paper describes continuing work on semantic frame slot filling for a command and control task using a weakly-supervised approach.

slot-filling Slot Filling

Multilingual Cross-domain Perspectives on Online Hate Speech

no code implementations11 Sep 2018 Tom De Smedt, Sylvia Jaki, Eduan Kotzé, Leïla Saoud, Maja Gwóźdź, Guy De Pauw, Walter Daelemans

In this report, we present a study of eight corpora of online hate speech, by demonstrating the NLP techniques that we used to collect and analyze the jihadist, extremist, racist, and sexist content.

General Classification text-classification +1

Automatic Detection of Online Jihadist Hate Speech

no code implementations13 Mar 2018 Tom De Smedt, Guy De Pauw, Pieter Van Ostaeyen

We have developed a system that automatically detects online jihadist hate speech with over 80% accuracy, by using techniques from Natural Language Processing and Machine Learning.

BIG-bench Machine Learning

Automatic Detection of Cyberbullying in Social Media Text

no code implementations17 Jan 2018 Cynthia Van Hee, Gilles Jacobs, Chris Emmery, Bart Desmet, Els Lefever, Ben Verhoeven, Guy De Pauw, Walter Daelemans, Véronique Hoste

While social media offer great communication opportunities, they also increase the vulnerability of young people to threatening situations online.

Binary Classification

The Netlog Corpus. A Resource for the Study of Flemish Dutch Internet Language

no code implementations LREC 2012 Mike Kestemont, Claudia Peersman, Benny De Decker, Guy De Pauw, Kim Luyckx, Roser Morante, Frederik Vaassen, Janneke van de Loo, Walter Daelemans

Although in recent years numerous forms of Internet communication ― such as e-mail, blogs, chat rooms and social network environments ― have emerged, balanced corpora of Internet speech with trustworthy meta-information (e. g. age and gender) or linguistic annotations are still limited.

Lemmatization POS +2

Cannot find the paper you are looking for? You can Submit a new open access paper.