Search Results for author: Gregor Wiedemann

Found 21 papers, 6 papers with code

A Generalized Approach to Protest Event Detection in German Local News

no code implementations LREC 2022 Gregor Wiedemann, Jan Matti Dollbaum, Sebastian Haunss, Priska Daphi, Larissa Daria Meier

However, in a second experiment, we show that our model does not generalize equally well when applied to data from time periods and localities other than our training sample.

Event Detection Management

Few-Shot Learning for Argument Aspects of the Nuclear Energy Debate

1 code implementation LREC 2022 Lena Jurkschat, Gregor Wiedemann, Maximilian Heinrich, Mattes Ruckdeschel, Sunna Torge

We approach aspect-based argument mining as a supervised machine learning task to classify arguments into semantically coherent groups referring to the same defined aspect categories.

Argument Mining Few-Shot Learning

Application of the interactive Leipzig Corpus Miner as a generic research platform for the use in the social sciences

no code implementations6 Oct 2021 Christian Kahmann, Andreas Niekler, Gregor Wiedemann

This article introduces to the interactive Leipzig Corpus Miner (iLCM) - a newly released, open-source software to perform automatic content analysis.

Forum 4.0: An Open-Source User Comment Analysis Framework

no code implementations EACL 2021 Marlo Haering, Jakob Smedegaard Andersen, Chris Biemann, Wiebke Loosen, Benjamin Milde, Tim Pietz, Christian St{\"o}cker, Gregor Wiedemann, Olaf Zukunft, Walid Maalej

With the increasing number of user comments in diverse domains, including comments on online journalism and e-commerce websites, the manual content analysis of these comments becomes time-consuming and challenging.

BIG-bench Machine Learning

Transfer Learning from LDA to BiLSTM-CNN for Offensive Language Detection in Twitter

no code implementations7 Nov 2018 Gregor Wiedemann, Eugen Ruppert, Raghav Jindal, Chris Biemann

Best results are achieved from pre-training our model on the unsupervised topic clustering of tweets in combination with thematic user cluster information.

Clustering General Classification +1

microNER: A Micro-Service for German Named Entity Recognition based on BiLSTM-CRF

no code implementations7 Nov 2018 Gregor Wiedemann, Raghav Jindal, Chris Biemann

We evaluate the performance of different word and character embeddings on two standard German datasets and with a special focus on out-of-vocabulary words.

named-entity-recognition Named Entity Recognition +2

A Multilingual Information Extraction Pipeline for Investigative Journalism

no code implementations EMNLP 2018 Gregor Wiedemann, Seid Muhie Yimam, Chris Biemann

We introduce an advanced information extraction pipeline to automatically process very large collections of unstructured textual data for the purpose of investigative journalism.

Entity Extraction using GAN

New/s/leak 2.0 - Multilingual Information Extraction and Visualization for Investigative Journalism

no code implementations13 Jul 2018 Gregor Wiedemann, Seid Muhie Yimam, Chris Biemann

Investigative journalism in recent years is confronted with two major challenges: 1) vast amounts of unstructured data originating from large text collections such as leaks or answers to Freedom of Information requests, and 2) multi-lingual data due to intensified global cooperation and communication in politics, business and civil society.

Efficient Exploration

iLCM - A Virtual Research Infrastructure for Large-Scale Qualitative Data

no code implementations LREC 2018 Andreas Niekler, Arnim Bleier, Christian Kahmann, Lisa Posch, Gregor Wiedemann, Kenan Erdogan, Gerhard Heyer, Markus Strohmaier

The iLCM project pursues the development of an integrated research environment for the analysis of structured and unstructured data in a "Software as a Service" architecture (SaaS).

Page Stream Segmentation with Convolutional Neural Nets Combining Textual and Visual Features

no code implementations LREC 2018 Gregor Wiedemann, Gerhard Heyer

In recent years, (retro-)digitizing paper-based files became a major undertaking for private and public archives as well as an important task in electronic mailroom applications.

Optical Character Recognition Optical Character Recognition (OCR) +1

Leipzig Corpus Miner - A Text Mining Infrastructure for Qualitative Data Analysis

no code implementations11 Jul 2017 Andreas Niekler, Gregor Wiedemann, Gerhard Heyer

This paper presents the "Leipzig Corpus Miner", a technical infrastructure for supporting qualitative and quantitative content analysis.

Information Retrieval Retrieval

Modeling the dynamics of domain specific terminology in diachronic corpora

no code implementations11 Jul 2017 Gerhard Heyer, Cathleen Kantner, Andreas Niekler, Max Overbeck, Gregor Wiedemann

In terminology work, natural language processing, and digital humanities, several studies address the analysis of variations in context and meaning of terms in order to detect semantic change and the evolution of terms.

Term Extraction

Cannot find the paper you are looking for? You can Submit a new open access paper.