Search Results for author: Krister Lind{\'e}n

Found 25 papers, 2 papers with code

Building Web Corpora for Minority Languages

no code implementations LREC 2020 Heidi Jauhiainen, Tommi Jauhiainen, Krister Lind{\'e}n

Web corpora creation for minority languages that do not have their own top-level Internet domain is no trivial matter.

Language Identification

Discriminating between Mandarin Chinese and Swiss-German varieties using adaptive language models

no code implementations WS 2019 Tommi Jauhiainen, Krister Lind{\'e}n, Heidi Jauhiainen

This paper describes the language identification systems used by the SUKI team in the Discriminating between the Mainland and Taiwan variation of Mandarin Chinese (DMT) and the German Dialect Identification (GDI) shared tasks which were held as part of the third VarDial Evaluation Campaign.

Dialect Identification Language Modelling

Iterative Language Model Adaptation for Indo-Aryan Language Identification

no code implementations COLING 2018 Tommi Jauhiainen, Heidi Jauhiainen, Krister Lind{\'e}n

This paper presents the experiments and results obtained by the SUKI team in the Indo-Aryan Language Identification shared task of the VarDial 2018 Evaluation Campaign.

Language Identification Language Modelling

HeLI-based Experiments in Discriminating Between Dutch and Flemish Subtitles

no code implementations COLING 2018 Tommi Jauhiainen, Heidi Jauhiainen, Krister Lind{\'e}n

This paper presents the experiments and results obtained by the SUKI team in the Discriminating between Dutch and Flemish in Subtitles shared task of the VarDial 2018 Evaluation Campaign.

Language Identification Text Categorization

HeLI-based Experiments in Swiss German Dialect Identification

no code implementations COLING 2018 Tommi Jauhiainen, Heidi Jauhiainen, Krister Lind{\'e}n

In this paper we present the experiments and results by the SUKI team in the German Dialect Identification shared task of the VarDial 2018 Evaluation Campaign.

Dialect Identification

Evaluating HeLI with Non-Linear Mappings

no code implementations WS 2017 Tommi Jauhiainen, Krister Lind{\'e}n, Heidi Jauhiainen

In this paper we describe the non-linear mappings we used with the Helsinki language identification method, HeLI, in the 4th edition of the Discriminating between Similar Languages (DSL) shared task, which was organized as part of the VarDial 2017 workshop.

Language Identification

CLARA: A New Generation of Researchers in Common Language Resources and Their Applications

no code implementations LREC 2014 Koenraad De Smedt, Erhard Hinrichs, Detmar Meurers, Inguna Skadi{\c{n}}a, Bolette Pedersen, Costanza Navarretta, N{\'u}ria Bel, Krister Lind{\'e}n, Mark{\'e}ta Lopatkov{\'a}, Jan Haji{\v{c}}, Gisle Andersen, Przemyslaw Lenkiewicz

CLARA (Common Language Resources and Their Applications) is a Marie Curie Initial Training Network which ran from 2009 until 2014 with the aim of providing researcher training in crucial areas related to language resources and infrastructure.

HFST-SweNER --- A New NER Resource for Swedish

no code implementations LREC 2014 Dimitrios Kokkinakis, Jyrki Niemi, Sam Hardwick, Krister Lind{\'e}n, Lars Borin

Named entity recognition (NER) is a knowledge-intensive information extraction task that is used for recognizing textual mentions of entities that belong to a predefined set of categories, such as locations, organizations and time expressions.

Machine Translation Named Entity Recognition +4

Heuristic Hyper-minimization of Finite State Lexicons

no code implementations LREC 2014 Senka Drobac, Krister Lind{\'e}n, Tommi Pirinen, Miikka Silfverberg

The most noticeable reduction in size we got with a morphological transducer for Greenlandic, whose original size is on average about 15 times larger than other morphologies.

Representing the Translation Relation in a Bilingual Wordnet

no code implementations LREC 2012 Jyrki Niemi, Krister Lind{\'e}n

FiWN was created by translating all the word senses of the Princeton WordNet (PWN) into Finnish and by joining the translations with the semantic and lexical relations of PWN extracted into a relational (database) format.

Translation

Specifying Treebanks, Outsourcing Parsebanks: FinnTreeBank 3

no code implementations LREC 2012 Atro Voutilainen, Kristiina Muhonen, Tanja Purtonen, Krister Lind{\'e}n

We argue for use of large descriptive grammars and their sample sentences as a basis for specifying higher-coverage grammatical representations.

Cannot find the paper you are looking for? You can Submit a new open access paper.