Search Results for author: Nils Diewald

Found 4 papers, 0 papers with code

Matrix and Double-Array Representations for Efficient Finite State Tokenization

no code implementations CMLC (LREC) 2022 Nils Diewald

This paper presents an algorithm and implementation for efficient tokenization of space-delimited languages based on a deterministic finite state automaton.

RKorAPClient: An R Package for Accessing the German Reference Corpus DeReKo via KorAP

no code implementations LREC 2020 Marc Kupietz, Nils Diewald, Eliza Margaretha

Making corpora accessible and usable for linguistic research is a huge challenge in view of (too) big data, legal issues and a rapidly evolving methodology.

Access control by query rewriting: the case of KorAP

no code implementations LREC 2014 Piotr Ba{\'n}ski, Nils Diewald, Michael Hanl, Marc Kupietz, Andreas Witt

We present an approach to an aspect of managing complex access scenarios to large and heterogeneous corpora that involves handling user queries that, intentionally or due to the complexity of the queried resource, target texts or annotations outside of the given userÂ’s permissions.

Cannot find the paper you are looking for? You can Submit a new open access paper.