Vietnamese Word Segmentation

6 papers with code • 0 benchmarks • 0 datasets

This task has no description! Would you like to contribute one?

Most implemented papers

A hybrid approach to Vietnamese word segmentation

phongnt570/UETsegmenter 29 Dec 2016

Word segmentation is the very first task for Vietnamese language processing.

Vietnamese Word Segmentation with SVM: Ambiguity Reduction and Suffix Capture

ngannlt/UITws-v1 14 Jun 2020

In this paper, we approach Vietnamese word segmentation as a binary classification by using the Support Vector Machine classifier.

A Pilot Study of Text-to-SQL Semantic Parsing for Vietnamese

VinAIResearch/ViText2SQL Findings of the Association for Computational Linguistics 2020

We compare the two baselines with key configurations and find that: automatic Vietnamese word segmentation improves the parsing results of both baselines; the normalized pointwise mutual information (NPMI) score (Bouma, 2009) is useful for schema linking; latent syntactic features extracted from a neural dependency parser for Vietnamese also improve the results; and the monolingual language model PhoBERT for Vietnamese (Nguyen and Nguyen, 2020) helps produce higher performances than the recent best multilingual language model XLM-R (Conneau et al., 2020).

COVID-19 Named Entity Recognition for Vietnamese

VinAIResearch/PhoNER_COVID19 NAACL 2021

The current COVID-19 pandemic has lead to the creation of many corpora that facilitate NLP research and downstream applications to help fight the pandemic.