Search Results for author: Guy De Pauw

Found 11 papers, 2 papers with code

4chan & 8chan embeddings

no code implementations • 2 Apr 2020 • Pierre Voué, Tom De Smedt, Guy De Pauw

We have collected over 30M messages from the publicly available /pol/ message boards on 4chan and 8chan, and compiled them into a model of toxic language use.

Hate Speech Detection Word Embeddings

Paper
Add Code

Current Limitations in Cyberbullying Detection: on Evaluation Criteria, Reproducibility, and Data Scarcity

1 code implementation • 25 Oct 2019 • Chris Emmery, Ben Verhoeven, Guy De Pauw, Gilles Jacobs, Cynthia Van Hee, Els Lefever, Bart Desmet, Véronique Hoste, Walter Daelemans

The detection of online cyberbullying has seen an increase in societal importance, popularity in research, and available open data.

Domain Generalization

Paper
Code

A weakly supervised sequence tagging and grammar induction approach to semantic frame slot filling

no code implementations • 15 Jun 2019 • Janneke van de Loo, Guy De Pauw, Walter Daelemans

This paper describes continuing work on semantic frame slot filling for a command and control task using a weakly-supervised approach.

slot-filling Slot Filling

Paper
Add Code

Effective weakly supervised semantic frame induction using expression sharing in hierarchical hidden Markov models

1 code implementation • 30 Jan 2019 • Janneke van de Loo, Jort F. Gemmeke, Guy De Pauw, Bart Ons, Walter Daelemans, Hugo Van hamme

We present a framework for the induction of semantic frames from utterances in the context of an adaptive command-and-control interface.

Weakly Supervised Classification

Paper
Code

Multilingual Cross-domain Perspectives on Online Hate Speech

no code implementations • 11 Sep 2018 • Tom De Smedt, Sylvia Jaki, Eduan Kotzé, Leïla Saoud, Maja Gwóźdź, Guy De Pauw, Walter Daelemans

In this report, we present a study of eight corpora of online hate speech, by demonstrating the NLP techniques that we used to collect and analyze the jihadist, extremist, racist, and sexist content.

General Classification text-classification +1

Paper
Add Code

Automatic Detection of Online Jihadist Hate Speech

no code implementations • 13 Mar 2018 • Tom De Smedt, Guy De Pauw, Pieter Van Ostaeyen

We have developed a system that automatically detects online jihadist hate speech with over 80% accuracy, by using techniques from Natural Language Processing and Machine Learning.

BIG-bench Machine Learning

Paper
Add Code

Automatic Detection of Cyberbullying in Social Media Text

no code implementations • 17 Jan 2018 • Cynthia Van Hee, Gilles Jacobs, Chris Emmery, Bart Desmet, Els Lefever, Ben Verhoeven, Guy De Pauw, Walter Daelemans, Véronique Hoste

While social media offer great communication opportunities, they also increase the vulnerability of young people to threatening situations online.

Binary Classification

Paper
Add Code

Detection and Fine-Grained Classification of Cyberbullying Events

no code implementations • RANLP 2015 • Cynthia Van Hee, Els Lefever, Ben Verhoeven, Julie Mennes, Bart Desmet, Guy De Pauw, Walter Daelemans, Veronique Hoste

Classification General Classification

Paper
Add Code

A Self Learning Vocal Interface for Speech-impaired Users

no code implementations • WS 2013 • Bart Ons, Netsanet Tessema, Janneke van de Loo, Jort Gemmeke, Guy De Pauw, Walter Daelemans, Hugo Van hamme

Self-Learning Speech Recognition

Paper
Add Code

Towards a Self-Learning Assistive Vocal Interface: Vocabulary and Grammar Learning

no code implementations • WS 2012 • Janneke van de Loo, Jort F. Gemmeke, Guy De Pauw, Joris Driesen, Hugo Van hamme, Walter Daelemans

Self-Learning Speech Recognition

Paper
Add Code

The Netlog Corpus. A Resource for the Study of Flemish Dutch Internet Language

no code implementations • LREC 2012 • Mike Kestemont, Claudia Peersman, Benny De Decker, Guy De Pauw, Kim Luyckx, Roser Morante, Frederik Vaassen, Janneke van de Loo, Walter Daelemans

Although in recent years numerous forms of Internet communication ― such as e-mail, blogs, chat rooms and social network environments ― have emerged, balanced corpora of Internet speech with trustworthy meta-information (e. g. age and gender) or linguistic annotations are still limited.

Lemmatization POS +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.