TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Chinese Word Segmentation	AS	WMSeg + ZEN	F1	96.62	# 2
Chinese Word Segmentation	CITYU	WMSeg + ZEN	F1	97.93	# 1
Chinese Word Segmentation	CTB6	WMSeg + ZEN	F1	97.25	# 4
Chinese Word Segmentation	MSR	WMSeg + ZEN	F1	98.40	# 3
Chinese Word Segmentation	PKU	WMSeg + ZEN	F1	96.53	# 4

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/improving-chinese-word-segmentation-with/chinese-word-segmentation-on-cityu)](https://paperswithcode.com/sota/chinese-word-segmentation-on-cityu?p=improving-chinese-word-segmentation-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/improving-chinese-word-segmentation-with/chinese-word-segmentation-on-as)](https://paperswithcode.com/sota/chinese-word-segmentation-on-as?p=improving-chinese-word-segmentation-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/improving-chinese-word-segmentation-with/chinese-word-segmentation-on-msr)](https://paperswithcode.com/sota/chinese-word-segmentation-on-msr?p=improving-chinese-word-segmentation-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/improving-chinese-word-segmentation-with/chinese-word-segmentation-on-ctb6)](https://paperswithcode.com/sota/chinese-word-segmentation-on-ctb6?p=improving-chinese-word-segmentation-with)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/improving-chinese-word-segmentation-with/chinese-word-segmentation-on-pku)](https://paperswithcode.com/sota/chinese-word-segmentation-on-pku?p=improving-chinese-word-segmentation-with)`

Improving Chinese Word Segmentation with Wordhood Memory Networks

ACL 2020 · Yuanhe Tian, Yan Song, Fei Xia, Tong Zhang, Yonggang Wang ·

Contextual features always play an important role in Chinese word segmentation (CWS). Wordhood information, being one of the contextual features, is proved to be useful in many conventional character-based segmenters. However, this feature receives less attention in recent neural models and it is also challenging to design a framework that can properly integrate wordhood information from different wordhood measures to existing neural frameworks. In this paper, we therefore propose a neural framework, WMSeg, which uses memory networks to incorporate wordhood information with several popular encoder-decoder combinations for CWS. Experimental results on five benchmark datasets indicate the memory mechanism successfully models wordhood information for neural segmenters and helps WMSeg achieve state-of-the-art performance on all those datasets. Further experiments and analyses also demonstrate the robustness of our proposed framework with respect to different wordhood measures and the efficiency of wordhood information in cross-domain experiments.

PDF Abstract

Code

Add Remove Mark official

SVAIGBA/WMSeg official

173

Tasks

Add Remove

Chinese Word Segmentation

Datasets

Add Datasets introduced or used in this paper

Results from the Paper

Add Remove

Ranked #1 on Chinese Word Segmentation on CITYU

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Chinese Word Segmentation	AS	WMSeg + ZEN	F1	96.62	# 2	Compare
Chinese Word Segmentation	CITYU	WMSeg + ZEN	F1	97.93	# 1	Compare
Chinese Word Segmentation	CTB6	WMSeg + ZEN	F1	97.25	# 4	Compare
Chinese Word Segmentation	MSR	WMSeg + ZEN	F1	98.40	# 3	Compare
Chinese Word Segmentation	PKU	WMSeg + ZEN	F1	96.53	# 4	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Improving Chinese Word Segmentation with Wordhood Memory Networks

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove