1 code implementation • LREC 2022 • Lena Jurkschat, Gregor Wiedemann, Maximilian Heinrich, Mattes Ruckdeschel, Sunna Torge
We approach aspect-based argument mining as a supervised machine learning task to classify arguments into semantically coherent groups referring to the same defined aspect categories.
no code implementations • LREC 2022 • Gregor Wiedemann, Jan Matti Dollbaum, Sebastian Haunss, Priska Daphi, Larissa Daria Meier
However, in a second experiment, we show that our model does not generalize equally well when applied to data from time periods and localities other than our training sample.
1 code implementation • EMNLP 2021 • Erik Körner, Gregor Wiedemann, Ahmad Dawar Hakimi, Gerhard Heyer, Martin Potthast
To ease the difficulty of argument stance classification, the task of same side stance classification (S3C) has been proposed.
1 code implementation • ArgMining (ACL) 2022 • Mattes Ruckdeschel, Gregor Wiedemann
This dataset enables both the supervised learning of boundaries and categorization of argument aspects.
no code implementations • 6 Dec 2024 • Jonas Rieger, Mattes Ruckdeschel, Gregor Wiedemann
Few-shot learning and parameter-efficient fine-tuning (PEFT) are crucial to overcome the challenges of data scarcity and ever growing language model sizes.
no code implementations • 28 Dec 2023 • Jonas Rieger, Kostiantyn Yanchenko, Mattes Ruckdeschel, Gerret von Nordheim, Katharina Kleinen-von Königslöw, Gregor Wiedemann
In this study, we address these challenges by using a multilingual transformer model in combination with the adapter extension to transformers, and few-shot learning methods.
2 code implementations • 9 Oct 2022 • Aida Kostikova, Benjamin Paassen, Dominik Beese, Ole Pütz, Gregor Wiedemann, Steffen Eger
Solidarity is a crucial concept to understand social relations in societies.
no code implementations • 6 Oct 2021 • Christian Kahmann, Andreas Niekler, Gregor Wiedemann
This article introduces to the interactive Leipzig Corpus Miner (iLCM) - a newly released, open-source software to perform automatic content analysis.
no code implementations • EACL 2021 • Marlo Haering, Jakob Smedegaard Andersen, Chris Biemann, Wiebke Loosen, Benjamin Milde, Tim Pietz, Christian St{\"o}cker, Gregor Wiedemann, Olaf Zukunft, Walid Maalej
With the increasing number of user comments in diverse domains, including comments on online journalism and e-commerce websites, the manual content analysis of these comments becomes time-consuming and challenging.
no code implementations • SEMEVAL 2020 • Gregor Wiedemann, Seid Muhie Yimam, Chris Biemann
Fine-tuning of pre-trained transformer networks such as BERT yield state-of-the-art results for text classification tasks.
1 code implementation • Lang Resources & Evaluation 2019 • Gregor Wiedemann, Gerhard Heyer
In the case of multi-page documents, the preservation of document contexts is a major requirement.
Optical Character Recognition Optical Character Recognition (OCR) +2
1 code implementation • 23 Sep 2019 • Gregor Wiedemann, Steffen Remus, Avi Chawla, Chris Biemann
Since vectors of the same word type can vary depending on the respective context, they implicitly provide a model for word sense disambiguation (WSD).
1 code implementation • ACL 2019 • Max Friedrich, Arne Köhn, Gregor Wiedemann, Chris Biemann
De-identification is the task of detecting protected health information (PHI) in medical text.
no code implementations • SEMEVAL 2019 • Gregor Wiedemann, Eugen Ruppert, Chris Biemann
We present a neural network based approach of transfer learning for offensive language detection.
no code implementations • 7 Nov 2018 • Gregor Wiedemann, Raghav Jindal, Chris Biemann
We evaluate the performance of different word and character embeddings on two standard German datasets and with a special focus on out-of-vocabulary words.
no code implementations • 7 Nov 2018 • Gregor Wiedemann, Eugen Ruppert, Raghav Jindal, Chris Biemann
Best results are achieved from pre-training our model on the unsupervised topic clustering of tweets in combination with thematic user cluster information.
no code implementations • EMNLP 2018 • Gregor Wiedemann, Seid Muhie Yimam, Chris Biemann
We introduce an advanced information extraction pipeline to automatically process very large collections of unstructured textual data for the purpose of investigative journalism.
no code implementations • 13 Jul 2018 • Gregor Wiedemann, Seid Muhie Yimam, Chris Biemann
Investigative journalism in recent years is confronted with two major challenges: 1) vast amounts of unstructured data originating from large text collections such as leaks or answers to Freedom of Information requests, and 2) multi-lingual data due to intensified global cooperation and communication in politics, business and civil society.
no code implementations • LREC 2018 • Andreas Niekler, Arnim Bleier, Christian Kahmann, Lisa Posch, Gregor Wiedemann, Kenan Erdogan, Gerhard Heyer, Markus Strohmaier
The iLCM project pursues the development of an integrated research environment for the analysis of structured and unstructured data in a "Software as a Service" architecture (SaaS).
no code implementations • LREC 2018 • Gregor Wiedemann, Gerhard Heyer
In recent years, (retro-)digitizing paper-based files became a major undertaking for private and public archives as well as an important task in electronic mailroom applications.
Optical Character Recognition Optical Character Recognition (OCR) +1
no code implementations • 11 Jul 2017 • Gerhard Heyer, Cathleen Kantner, Andreas Niekler, Max Overbeck, Gregor Wiedemann
In terminology work, natural language processing, and digital humanities, several studies address the analysis of variations in context and meaning of terms in order to detect semantic change and the evolution of terms.
no code implementations • 11 Jul 2017 • Andreas Niekler, Gregor Wiedemann, Gerhard Heyer
This paper presents the "Leipzig Corpus Miner", a technical infrastructure for supporting qualitative and quantitative content analysis.