1 code implementation • RANLP 2021 • Thodsaporn Chay-intr, Hidetaka Kamigaito, Manabu Okumura
These models estimate word boundaries from a character sequence.
Ranked #2 on Thai Word Segmentation on BEST-2010
2 code implementations • Journal of Natural Language Processing 2023 • Thodsaporn Chay-intr, Hidetaka Kamigaito, Kotaro Funakoshi, Manabu Okumura
Our model employs the lattice structure to handle segmentation alternatives and utilizes graph neural networks along with an attention mechanism to attentively extract multi-granularity representation from the lattice for complementing character representations.
Ranked #1 on Chinese Word Segmentation on CTB6 (using extra training data)