1 code implementation • 21 Nov 2023 • Panyut Sriwirote, Jalinee Thapiang, Vasan Timtong, Attapol T. Rutherford
While WangchanBERTa has become the de facto standard in transformer-based Thai language modeling, it still has shortcomings in regard to the understanding of foreign words, most notably English words, which are often borrowed without orthographic assimilation into Thai in many contexts.
no code implementations • 24 Aug 2021 • Jin Cheevaprawatdomrong, Alexandra Schofield, Attapol T. Rutherford
Traditionally, Latent Dirichlet Allocation (LDA) ingests words in a collection of documents to discover their latent topics using word-document co-occurrences.
no code implementations • 7 Jul 2020 • Lalita Lowphansirikul, Charin Polpanumas, Attapol T. Rutherford, Sarana Nutanong
The primary objective of our work is to build a large-scale English-Thai dataset for machine translation.
no code implementations • 7 Jun 2016 • Attapol T. Rutherford, Vera Demberg, Nianwen Xue
Inferring implicit discourse relations in natural language text is the most difficult subtask in discourse parsing.