no code implementations • Findings (ACL) 2022 • Jin Cheevaprawatdomrong, Alexandra Schofield, Attapol Rutherford
Traditionally, Latent Dirichlet Allocation (LDA) ingests words in a collection of documents to discover their latent topics using word-document co-occurrences.
no code implementations • 24 Aug 2021 • Jin Cheevaprawatdomrong, Alexandra Schofield, Attapol T. Rutherford
Traditionally, Latent Dirichlet Allocation (LDA) ingests words in a collection of documents to discover their latent topics using word-document co-occurrences.
no code implementations • COLING 2020 • Pattarawat Chormai, Ponrawee Prasertsom, Jin Cheevaprawatdomrong, Attapol Rutherford
Word segmentation is a challenging pre-processing step for Thai Natural Language Processing due to the lack of explicit word boundaries. The previous systems rely on powerful neural network architecture alone and ignore linguistic substructures of Thai words.