no code implementations • DCLRL (LREC) 2022 • Erik Körner, Felix Helfer, Christopher Schröder, Thomas Eckart, Dirk Goldhahn
The “Web as corpus” paradigm opens opportunities for enhancing the current state of language resources for endangered and under-resourced languages.
no code implementations • 12 Mar 2025 • Julia Romberg, Christopher Schröder, Julius Gonsior, Katrin Tomanek, Fredrik Olsson
Supervised learning relies on annotated data, which is expensive to obtain.
1 code implementation • 13 Jun 2024 • Christopher Schröder, Gerhard Heyer
Active learning is an iterative labeling process that is used to obtain a small labeled subset, despite the absence of labeled data, thereby enabling to train a model for supervised tasks such as text classification.
no code implementations • 14 Dec 2022 • Niklas Deckers, Maik Fröbe, Johannes Kiesel, Gianluca Pandolfo, Christopher Schröder, Benno Stein, Martin Potthast
Conditional generative models such as DALL-E and Stable Diffusion generate images based on a user-defined text, the prompt.
no code implementations • 9 Sep 2022 • Magdalena Wolska, Christopher Schröder, Ole Borchardt, Benno Stein, Martin Potthast
We present the first dataset and evaluation results on a newly defined computational task of trigger warning assignment.
no code implementations • 21 Nov 2021 • Maik Fröbe, Matthias Hagen, Janek Bevendorff, Michael Völske, Benno Stein, Christopher Schröder, Robby Wagner, Lukas Gienapp, Martin Potthast
Commercial web search engines employ near-duplicate detection to ensure that users see each relevant result only once, albeit the underlying web crawls typically include (near-)duplicates of many web pages.
1 code implementation • European Chapter of the Association for Computational Linguistics 2023 • Christopher Schröder, Lydia Müller, Andreas Niekler, Martin Potthast
We introduce small-text, an easy-to-use active learning library, which offers pool-based active learning for single- and multi-label text classification in Python.
2 code implementations • Findings (ACL) 2022 • Christopher Schröder, Andreas Niekler, Martin Potthast
Active learning is the iterative construction of a classification model through targeted labeling, enabling significant labeling cost savings.
no code implementations • ACL 2021 • Christopher Schröder, Kim Bürgl, Yves Annanias, Andreas Niekler, Lydia Müller, Daniel Wiegreffe, Christian Bender, Christoph Mengs, Gerik Scheuermann, Gerhard Heyer
In total, we process nine categories and actively learn their representation in our dataset.
no code implementations • 17 Aug 2020 • Christopher Schröder, Andreas Niekler
We review AL for text classification using deep neural networks (DNNs) and elaborate on two main causes which used to hinder the adoption: (a) the inability of NNs to provide reliable uncertainty estimates, on which the most commonly used query strategies rely, and (b) the challenge of training DNNs on small data.