1 code implementation • 7 Dec 2023 • Wannaphong Phatthiyaphaibun, Korakot Chaovavanich, Charin Polpanumas, Arthit Suriyawongkul, Lalita Lowphansirikul, Pattarawat Chormai, Peerat Limkonchotiwat, Thanathip Suntorntip, Can Udomcharoenchaikit
It provides a wide range of software, models, and datasets for Thai language.
1 code implementation • 6 Nov 2023 • Peerat Limkonchotiwat, Wuttikorn Ponwitayarat, Lalita Lowphansirikul, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong
In this paper, we propose a framework called Self-supervised Cross-View Training (SCT) to narrow the performance gap between large and small PLMs.
2 code implementations • 24 Jan 2021 • Lalita Lowphansirikul, Charin Polpanumas, Nawat Jantrakulchai, Sarana Nutanong
However, for a relatively low-resource language such as Thai, the choices of models are limited to training a BERT-based model based on a much smaller dataset or finetuning multi-lingual models, both of which yield suboptimal downstream performance.
no code implementations • 7 Jul 2020 • Lalita Lowphansirikul, Charin Polpanumas, Attapol T. Rutherford, Sarana Nutanong
The primary objective of our work is to build a large-scale English-Thai dataset for machine translation.