1 code implementation • LREC 2020 • Suteera Seeha, Ivan Bilan, Liliana Mamani Sanchez, Johannes Huber, Michael Matuschek, Hinrich Sch{\"u}tze
We propose ThaiLMCut, a semi-supervised approach for Thai word segmentation which utilizes a bi-directional character language model (LM) as a way to leverage useful linguistic knowledge from unlabeled data.
Ranked #3 on Thai Word Segmentation on BEST-2010
no code implementations • WS 2016 • Hector-Hugo Franco-Penya, Liliana Mamani Sanchez
This paper describes an analysis of our submissions to the Dialect Detection Shared Task 2016.