no code implementations • 26 Jun 2019 • Kai Hakala, Aleksi Vesanto, Niko Miekka, Tapio Salakoski, Filip Ginter
A common approach for improving OCR quality is a post-processing step based on models correcting misdetected characters and tokens.
no code implementations • CONLL 2018 • Jenna Kanerva, Filip Ginter, Niko Miekka, Akseli Leino, Tapio Salakoski
In this paper we describe the TurkuNLP entry at the CoNLL 2018 Shared Task on Multilingual Parsing from Raw Text to Universal Dependencies.
Ranked #5 on Dependency Parsing on Universal Dependencies