Search Results for author: Sebastian Nordhoff

Found 7 papers, 0 papers with code

Modelling and Annotating Interlinear Glossed Text from 280 Different Endangered Languages as Linked Data with LIGT

no code implementations COLING (LAW) 2020 Sebastian Nordhoff

This paper reports on the harvesting, analysis, and enrichment of 20k documents from 4 different endangered language archives in 300 different low-resource languages.

From the attic to the cloud: mobilization of endangered language resources with linked data

no code implementations LREC 2020 Sebastian Nordhoff

This paper describes a collection of 20k ELAN annotation files harvested from five different endangered language archives.


Extracting Interlinear Glossed Text from LaTeX Documents

no code implementations LREC 2016 Mathias Schenner, Sebastian Nordhoff

We present texigt, a command-line tool for the extraction of structured linguistic data from LaTeX source documents, and a language resource that has been generated using this tool: a corpus of interlinear glossed text (IGT) extracted from open access books published by Language Science Press.

The Alaskan Athabascan Grammar Database

no code implementations LREC 2016 Sebastian Nordhoff, Siri Tuttle, Olga Lovick

This paper describes a repository of example sentences in three endangered Athabascan languages: Koyukon, Upper Tanana, Lower Tanana.

Glottolog/Langdoc:Increasing the visibility of grey literature for low-density languages

no code implementations LREC 2012 Sebastian Nordhoff, Harald Hammarstr{\"o}m

Language resources can be divided into structural resources treating phonology, morphosyntax, semantics etc.

Cannot find the paper you are looking for? You can Submit a new open access paper.