Search Results for author: Jin Cheevaprawatdomrong

Found 3 papers, 0 papers with code

More Than Words: Collocation Retokenization for Latent Dirichlet Allocation Models

no code implementations Findings (ACL) 2022 Jin Cheevaprawatdomrong, Alexandra Schofield, Attapol Rutherford

Traditionally, Latent Dirichlet Allocation (LDA) ingests words in a collection of documents to discover their latent topics using word-document co-occurrences.

More Than Words: Collocation Tokenization for Latent Dirichlet Allocation Models

no code implementations24 Aug 2021 Jin Cheevaprawatdomrong, Alexandra Schofield, Attapol T. Rutherford

Traditionally, Latent Dirichlet Allocation (LDA) ingests words in a collection of documents to discover their latent topics using word-document co-occurrences.

Syllable-based Neural Thai Word Segmentation

no code implementations COLING 2020 Pattarawat Chormai, Ponrawee Prasertsom, Jin Cheevaprawatdomrong, Attapol Rutherford

Word segmentation is a challenging pre-processing step for Thai Natural Language Processing due to the lack of explicit word boundaries. The previous systems rely on powerful neural network architecture alone and ignore linguistic substructures of Thai words.

Segmentation Thai Word Segmentation

Cannot find the paper you are looking for? You can Submit a new open access paper.