1 code implementation • 21 Nov 2023 • Panyut Sriwirote, Jalinee Thapiang, Vasan Timtong, Attapol T. Rutherford
While WangchanBERTa has become the de facto standard in transformer-based Thai language modeling, it still has shortcomings in regard to the understanding of foreign words, most notably English words, which are often borrowed without orthographic assimilation into Thai in many contexts.