Chinese Word Segmentation
48 papers with code • 6 benchmarks • 3 datasets
Chinese word segmentation is the task of splitting Chinese text (i.e. a sequence of Chinese characters) into words (Source: www.nlpprogress.com).
Benchmarks
These leaderboards are used to track progress in Chinese Word Segmentation
Most implemented papers
Exploring Segment Representations for Neural Segmentation Models
Many natural language processing (NLP) tasks can be generalized into segmentation problem.
Neural Word Segmentation Learning for Chinese
Most previous approaches to Chinese word segmentation formalize this problem as a character-based sequence labeling task where only contextual information within fixed sized local windows and simple interactions between adjacent tags can be captured.
Fast and Accurate Neural Word Segmentation for Chinese
Neural models with minimal feature engineering have achieved competitive performance against traditional methods for the task of Chinese word segmentation.
Convolutional Neural Network with Word Embeddings for Chinese Word Segmentation
The first is that they heavily rely on manually designed bigram feature, i. e. they are not good at capturing n-gram features automatically.
Effective Neural Solution for Multi-Criteria Word Segmentation
We present a simple yet elegant solution to train a single joint model on multi-criteria corpora for Chinese Word Segmentation (CWS).
Dual Long Short-Term Memory Networks for Sub-Character Representation Learning
To build a concrete study and substantiate the efficiency of our neural architecture, we take Chinese Word Segmentation as a research case example.
Adaptive Multi-Task Transfer Learning for Chinese Word Segmentation in Medical Text
Chinese word segmentation (CWS) trained from open source corpus faces dramatic performance drop when dealing with domain text, especially for a domain with lots of special terms and diverse writing styles, such as the biomedical domain.
State-of-the-art Chinese Word Segmentation with Bi-LSTMs
A wide variety of neural-network architectures have been proposed for the task of Chinese word segmentation.