Search Results for author: Yoonseob Lim

Found 1 papers, 1 papers with code

Improving Korean NLP Tasks with Linguistically Informed Subword Tokenization and Sub-character Decomposition

1 code implementation7 Nov 2023 Taehee Jeon, BongSeok Yang, ChangHwan Kim, Yoonseob Lim

We introduce a morpheme-aware subword tokenization method that utilizes sub-character decomposition to address the challenges of applying Byte Pair Encoding (BPE) to Korean, a language characterized by its rich morphology and unique writing system.

CoLA Computational Efficiency +1

Cannot find the paper you are looking for? You can Submit a new open access paper.