Search Results for author: Armin Hoenen

Found 11 papers, 2 papers with code

Can the Language of the Collation be Translated into the Language of the Stemma? Using Machine Translation for Witness Localization

no code implementations • 11 Jun 2022 • Armin Hoenen

Stemmatology is a subfield of philology where one approach to understand the copy-history of textual variants of a text (witnesses of a tradition) is to generate an evolutionary tree.

Machine Translation

Paper
Add Code

Two LRL \& Distractor Corpora from Web Information Retrieval and a Small Case Study in Language Identification without Training Corpora

no code implementations • LREC 2020 • Armin Hoenen, Cemre Koc, Marc Rahn

In recent years, low resource languages (LRLs) have seen a surge in interest after certain tasks have been solved for larger ones and as they present various challenges (data sparsity, sparsity of experts and expertise, unusual structural properties etc.).

Information Retrieval Language Identification +1

Paper
Add Code

Multi Modal Distance - An Approach to Stemma Generation With Weighting

1 code implementation • LREC 2018 • Armin Hoenen

Paper
Code

From Manuscripts to Archetypes through Iterative Clustering

no code implementations • LREC 2018 • Armin Hoenen

Clustering

Paper
Add Code

Knowing the Author by the Company His Words Keep

no code implementations • LREC 2018 • Armin Hoenen, Niko Schenk

Semantic Textual Similarity Word Embeddings

Paper
Add Code

How Many Stemmata with Root Degree k?

1 code implementation • WS 2017 • Armin Hoenen, Steffen Eger, Ralf Gehrke

Paper
Code

Language classification from bilingual word embedding graphs

no code implementations • COLING 2016 • Steffen Eger, Armin Hoenen, Alexander Mehler

We study the role of the second language in bilingual word embeddings in monolingual semantic evaluation tasks.

Classification General Classification +1

Paper
Add Code

Wikipedia Titles As Noun Tag Predictors

no code implementations • LREC 2016 • Armin Hoenen

In this paper, we investigate a covert labeling cue, namely the probability that a title (by example of the Wikipedia titles) is a noun.

Machine Translation Part-Of-Speech Tagging +4

Paper
Add Code

TGermaCorp -- A (Digital) Humanities Resource for (Computational) Linguistics

no code implementations • LREC 2016 • Andy Luecking, Armin Hoenen, Alex Mehler, er

In order to introduce TGermaCorp in comparison to more homogeneous corpora of contemporary everyday language, quantitative assessments of syntactic and lexical diversity are provided.

LEMMA POS

Paper
Add Code

Lachmannian Archetype Reconstruction for Ancient Manuscript Corpora

no code implementations • HLT 2015 • Armin Hoenen

Paper
Add Code

Source and Translation Classification using Most Frequent Words

no code implementations • IJCNLP 2013 • Zahurul Islam, Armin Hoenen

Classification General Classification +3

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.