no code implementations • 2 Apr 2020 • Pierre Voué, Tom De Smedt, Guy De Pauw
We have collected over 30M messages from the publicly available /pol/ message boards on 4chan and 8chan, and compiled them into a model of toxic language use.
1 code implementation • 25 Oct 2019 • Chris Emmery, Ben Verhoeven, Guy De Pauw, Gilles Jacobs, Cynthia Van Hee, Els Lefever, Bart Desmet, Véronique Hoste, Walter Daelemans
The detection of online cyberbullying has seen an increase in societal importance, popularity in research, and available open data.
no code implementations • 15 Jun 2019 • Janneke van de Loo, Guy De Pauw, Walter Daelemans
This paper describes continuing work on semantic frame slot filling for a command and control task using a weakly-supervised approach.
1 code implementation • 30 Jan 2019 • Janneke van de Loo, Jort F. Gemmeke, Guy De Pauw, Bart Ons, Walter Daelemans, Hugo Van hamme
We present a framework for the induction of semantic frames from utterances in the context of an adaptive command-and-control interface.
no code implementations • 11 Sep 2018 • Tom De Smedt, Sylvia Jaki, Eduan Kotzé, Leïla Saoud, Maja Gwóźdź, Guy De Pauw, Walter Daelemans
In this report, we present a study of eight corpora of online hate speech, by demonstrating the NLP techniques that we used to collect and analyze the jihadist, extremist, racist, and sexist content.
no code implementations • 13 Mar 2018 • Tom De Smedt, Guy De Pauw, Pieter Van Ostaeyen
We have developed a system that automatically detects online jihadist hate speech with over 80% accuracy, by using techniques from Natural Language Processing and Machine Learning.
no code implementations • 17 Jan 2018 • Cynthia Van Hee, Gilles Jacobs, Chris Emmery, Bart Desmet, Els Lefever, Ben Verhoeven, Guy De Pauw, Walter Daelemans, Véronique Hoste
While social media offer great communication opportunities, they also increase the vulnerability of young people to threatening situations online.
no code implementations • LREC 2012 • Mike Kestemont, Claudia Peersman, Benny De Decker, Guy De Pauw, Kim Luyckx, Roser Morante, Frederik Vaassen, Janneke van de Loo, Walter Daelemans
Although in recent years numerous forms of Internet communication ― such as e-mail, blogs, chat rooms and social network environments ― have emerged, balanced corpora of Internet speech with trustworthy meta-information (e. g. age and gender) or linguistic annotations are still limited.