1 code implementation • COLING (LaTeCHCLfL, CLFL, LaTeCH) 2020 • Thora Hagen, Erik Ketzan, Fotis Jannidis, Andreas Witt
This paper accompanies the corpus publication of EncycNet, a novel XML/TEI annotated corpus of 22 historical German encyclopedias from the early 18th to early 20th century.
no code implementations • 15 May 2014 • Laurent Romary, Andreas Witt
In recent years, new developments in the area of lexicography have altered not only the management, processing and publishing of lexicographical data, but also created new types of products such as electronic dictionaries and thesauri.
no code implementations • LREC 2014 • Piotr Ba{\'n}ski, Nils Diewald, Michael Hanl, Marc Kupietz, Andreas Witt
We present an approach to an aspect of managing complex access scenarios to large and heterogeneous corpora that involves handling user queries that, intentionally or due to the complexity of the queried resource, target texts or annotations outside of the given userÂ’s permissions.
no code implementations • LREC 2012 • Piotr Ba{\'n}ski, Peter M. Fischer, Elena Frick, Erik Ketzan, Marc Kupietz, Carsten Schnober, Oliver Schonefeld, Andreas Witt
The aim of this project is to develop an innovative corpus analysis platform to tackle the increasing demands of modern linguistic research.
no code implementations • LREC 2016 • Piotr Ba{\'n}ski, Elena Frick, Andreas Witt
The present paper describes Corpus Query Lingua Franca (ISO CQLF), a specification designed at ISO Technical Committee 37 Subcommittee 4 {``}Language resource management{''} for the purpose of facilitating the comparison of properties of corpus query languages.
no code implementations • LREC 2016 • Nils Diewald, Michael Hanl, Eliza Margaretha, Joachim Bingel, Marc Kupietz, Piotr Ba{\'n}ski, Andreas Witt
KorAP is a corpus search and analysis platform, developed at the Institute for the German Language (IDS).
no code implementations • LREC 2020 • Franciska de Jong, Bente Maegaard, Darja Fi{\v{s}}er, Dieter van Uytvanck, Andreas Witt
CLARIN is a European Research Infrastructure providing access to language resources and technologies for researchers in the humanities and social sciences.
no code implementations • LREC 2020 • Pawel Kamocki, Andreas Witt
The ambition of the proposed paper is to analyse the practical meaning of Privacy by Design in the context of Language Resources, and propose measures and safeguards that can be implemented by the community to ensure respect of this principle.
no code implementations • LREC 2020 • Rezvaneh Rezapour, Jutta Bopp, Norman Fiedler, Diana Steffen, Andreas Witt, Jana Diesner
This paper proposes, implements and evaluates a novel, corpus-based approach for identifying categories indicative of the impact of research via a deductive (top-down, from theory to data) and an inductive (bottom-up, from data to theory) approach.
no code implementations • LREC 2022 • Pawel Kamocki, Andreas Witt
Ethical issues in Language Resources and Language Technology are often invoked, but rarely discussed.