no code implementations • Findings (EMNLP) 2021 • Nattapol Trijakwanich, Peerat Limkonchotiwat, Raheem Sarwar, Wannaphong Phatthiyaphaibun, Ekapol Chuangsuwanich, Sarana Nutanong
Cross-lingual Sentence Retrieval (CLSR) aims at retrieving parallel sentence pairs that are translations of each other from a multilingual set of comparable documents.
1 code implementation • EMNLP 2020 • Peerat Limkonchotiwat, Wannaphong Phatthiyaphaibun, Raheem Sarwar, Ekapol Chuangsuwanich, Sarana Nutanong
Like many Natural Language Processing tasks, Thai word segmentation is domain-dependent.
Ranked #1 on
Thai Word Segmentation
on WS160
(using extra training data)
1 code implementation • 24 Mar 2024 • Wannaphong Phatthiyaphaibun, Surapon Nonesung, Patomporn Payoungkhamdee, Peerat Limkonchotiwat, Can Udomcharoenchaikit, Jitkapat Sawatphol, Chompakorn Chaksangchaichot, Ekapol Chuangsuwanich, Sarana Nutanong
Our model is based on SEA-LION and a collection of instruction following datasets.
1 code implementation • 7 Dec 2023 • Wannaphong Phatthiyaphaibun, Korakot Chaovavanich, Charin Polpanumas, Arthit Suriyawongkul, Lalita Lowphansirikul, Pattarawat Chormai, Peerat Limkonchotiwat, Thanathip Suntorntip, Can Udomcharoenchaikit
It provides a wide range of software, models, and datasets for Thai language.
1 code implementation • 9 Aug 2022 • Wannaphong Phatthiyaphaibun, Chompakorn Chaksangchaichot, Peerat Limkonchotiwat, Ekapol Chuangsuwanich, Sarana Nutanong
However, most of these ASR models are available in English; only a minority of the models are available in Thai.
Automatic Speech Recognition
Automatic Speech Recognition (ASR)
+2
1 code implementation • Zenodo 2022 • Wannaphong Phatthiyaphaibun
LaoNLP, the Lao language Natural Language Processing Toolkit, is an open-source tool for doing natural language processing pipelines with the Lao language in Python programming language.