1 code implementation • LREC 2020 • Suteera Seeha, Ivan Bilan, Liliana Mamani Sanchez, Johannes Huber, Michael Matuschek, Hinrich Sch{\"u}tze
We propose ThaiLMCut, a semi-supervised approach for Thai word segmentation which utilizes a bi-directional character language model (LM) as a way to leverage useful linguistic knowledge from unlabeled data.
Ranked #3 on Thai Word Segmentation on BEST-2010
no code implementations • TACL 2013 • Michael Matuschek, Iryna Gurevych
In this paper, we present Dijkstra-WSA, a novel graph-based algorithm for word sense alignment.
no code implementations • LREC 2012 • Judith Eckle-Kohler, Iryna Gurevych, Silvana Hartmann, Michael Matuschek, Christian M. Meyer
We present UBY-LMF, an LMF-based model for large-scale, heterogeneous multilingual lexical-semantic resources (LSRs).
no code implementations • LREC 2012 • Christian Chiarcos, Sebastian Hellmann, Sebastian Nordhoff, Steven Moran, Richard Littauer, Judith Eckle-Kohler, Iryna Gurevych, Silvana Hartmann, Michael Matuschek, Christian M. Meyer
This paper describes the Open Linguistics Working Group (OWLG) of the Open Knowledge Foundation (OKFN).