no code implementations • EMNLP (ACL) 2021 • Tariq Yousef, Antje Schlaf, Janos Borst, Andreas Niekler, Gerhard Heyer
Freedom of the press and media is of vital importance for democratically organised states and open societies.
no code implementations • 3 May 2023 • Fabian Ziegner, Janos Borst, Andreas Niekler, Martin Potthast
This paper evaluates the viability of using fixed language models for training text classification networks on low-end hardware.
no code implementations • 30 Nov 2022 • Janos Borst, Thomas Wencker, Andreas Niekler
To estimate realistic rates of overreporting in large data sets over times, we propose an approach based on state-of-the-art text classification.
no code implementations • 6 Oct 2021 • Christian Kahmann, Andreas Niekler, Gregor Wiedemann
This article introduces to the interactive Leipzig Corpus Miner (iLCM) - a newly released, open-source software to perform automatic content analysis.
1 code implementation • European Chapter of the Association for Computational Linguistics 2023 • Christopher Schröder, Lydia Müller, Andreas Niekler, Martin Potthast
We introduce small-text, an easy-to-use active learning library, which offers pool-based active learning for single- and multi-label text classification in Python.
2 code implementations • Findings (ACL) 2022 • Christopher Schröder, Andreas Niekler, Martin Potthast
Active learning is the iterative construction of a classification model through targeted labeling, enabling significant labeling cost savings.
no code implementations • ACL 2021 • Christopher Schröder, Kim Bürgl, Yves Annanias, Andreas Niekler, Lydia Müller, Daniel Wiegreffe, Christian Bender, Christoph Mengs, Gerik Scheuermann, Gerhard Heyer
In total, we process nine categories and actively learn their representation in our dataset.
no code implementations • 17 Aug 2020 • Christopher Schröder, Andreas Niekler
We review AL for text classification using deep neural networks (DNNs) and elaborate on two main causes which used to hinder the adoption: (a) the inability of NNs to provide reliable uncertainty estimates, on which the most commonly used query strategies rely, and (b) the challenge of training DNNs on small data.
no code implementations • LREC 2018 • Andreas Niekler, Arnim Bleier, Christian Kahmann, Lisa Posch, Gregor Wiedemann, Kenan Erdogan, Gerhard Heyer, Markus Strohmaier
The iLCM project pursues the development of an integrated research environment for the analysis of structured and unstructured data in a "Software as a Service" architecture (SaaS).
no code implementations • 15 Nov 2017 • Christian Kahmann, Andreas Niekler, Gerhard Heyer
The new measure of context volatility that we propose models the degree by which terms change context in a text collection over time.
no code implementations • 11 Jul 2017 • Gerhard Heyer, Cathleen Kantner, Andreas Niekler, Max Overbeck, Gregor Wiedemann
In terminology work, natural language processing, and digital humanities, several studies address the analysis of variations in context and meaning of terms in order to detect semantic change and the evolution of terms.
no code implementations • 11 Jul 2017 • Andreas Niekler, Gregor Wiedemann, Gerhard Heyer
This paper presents the "Leipzig Corpus Miner", a technical infrastructure for supporting qualitative and quantitative content analysis.
no code implementations • LREC 2014 • Christian Haenig, Andreas Niekler, Carsten Wuensch
In this paper, we describe a publicly available multilingual evaluation corpus for phrase-level Sentiment Analysis that can be used to evaluate real world applications in an industrial context.