no code implementations • LREC 2020 • Katrin Ortmann, Stefanie Dipper
Independently of the medial representation (written/spoken), language can exhibit characteristics of conceptual orality or literacy, which mainly manifest themselves on the lexical or syntactic level.
no code implementations • WS 2019 • Ronja Laarmann-Quante, Stefanie Dipper, Eva Belke
To date, corpus and computational linguistic work on written language acquisition has mostly dealt with second language learners who have usually already mastered orthography acquisition in their first language.
no code implementations • WS 2019 • Katrin Ortmann, Stefanie Dipper
This paper deals with the automatic identification of literate and oral discourse in German texts.
no code implementations • 14 Sep 2018 • Uwe Springmann, Christian Reul, Stefanie Dipper, Johannes Baiter
In this paper we describe a dataset of German and Latin \textit{ground truth} (GT) for historical OCR in the form of printed text line images paired with their transcription.
no code implementations • CL 2018 • Varada Kolhatkar, Adam Roussel, Stefanie Dipper, Heike Zinsmeister
Most of the existing approaches to anaphora annotation and resolution focus on nominal-antecedent anaphora, classifying many of the cases where the antecedents are syntactically non-nominal as non-anaphoric.
no code implementations • WS 2017 • Ronja Laarmann-Quante, Katrin Ortmann, Anna Ehlert, Maurice Vogel, Stefanie Dipper
NLP applications for learners often rely on annotated learner corpora.
no code implementations • WS 2017 • Stefanie Dipper, S Waldenberger, ra
This paper investigates diatopic variation in a historical corpus of German.
no code implementations • LREC 2012 • Stefanie Dipper, Melanie Seiss, Heike Zinsmeister
Motivated by the need to use a parallel resource for cross-linguistic feature induction in abstract anaphora resolution, this paper investigates properties of English and German texts in the Europarl corpus, taking into account both general features such as sentence length as well as task-dependent features such as the distribution of demonstrative noun phrases.