Search Results for author: Armin Hoenen

Found 11 papers, 2 papers with code

Can the Language of the Collation be Translated into the Language of the Stemma? Using Machine Translation for Witness Localization

no code implementations11 Jun 2022 Armin Hoenen

Stemmatology is a subfield of philology where one approach to understand the copy-history of textual variants of a text (witnesses of a tradition) is to generate an evolutionary tree.

Machine Translation

Two LRL \& Distractor Corpora from Web Information Retrieval and a Small Case Study in Language Identification without Training Corpora

no code implementations LREC 2020 Armin Hoenen, Cemre Koc, Marc Rahn

In recent years, low resource languages (LRLs) have seen a surge in interest after certain tasks have been solved for larger ones and as they present various challenges (data sparsity, sparsity of experts and expertise, unusual structural properties etc.).

Information Retrieval Language Identification +1

Wikipedia Titles As Noun Tag Predictors

no code implementations LREC 2016 Armin Hoenen

In this paper, we investigate a covert labeling cue, namely the probability that a title (by example of the Wikipedia titles) is a noun.

Machine Translation Part-Of-Speech Tagging +4

TGermaCorp -- A (Digital) Humanities Resource for (Computational) Linguistics

no code implementations LREC 2016 Andy Luecking, Armin Hoenen, Alex Mehler, er

In order to introduce TGermaCorp in comparison to more homogeneous corpora of contemporary everyday language, quantitative assessments of syntactic and lexical diversity are provided.

LEMMA POS

Cannot find the paper you are looking for? You can Submit a new open access paper.