Search Results for author: Michael Matuschek

Found 6 papers, 1 papers with code

ThaiLMCut: Unsupervised Pretraining for Thai Word Segmentation

1 code implementation LREC 2020 Suteera Seeha, Ivan Bilan, Liliana Mamani Sanchez, Johannes Huber, Michael Matuschek, Hinrich Sch{\"u}tze

We propose ThaiLMCut, a semi-supervised approach for Thai word segmentation which utilizes a bi-directional character language model (LM) as a way to leverage useful linguistic knowledge from unlabeled data.

Language Modelling Segmentation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.