TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Morpheme Segmentaiton	UniMorph 4.0	CLUZH-3	f1 macro avg (subtask 2)	88.14	# 3
Morpheme Segmentaiton	UniMorph 4.0	CLUZH-3	lev dist (subtask 2)	5.58	# 4
Morpheme Segmentaiton	UniMorph 4.0	CLUZH-2	f1 macro avg (subtask 2)	87.93	# 4
Morpheme Segmentaiton	UniMorph 4.0	CLUZH-2	lev dist (subtask 2)	5.62	# 5
Morpheme Segmentaiton	UniMorph 4.0	CLUZH-1	f1 macro avg (subtask 2)	87.68	# 5
Morpheme Segmentaiton	UniMorph 4.0	CLUZH-1	lev dist (subtask 2)	5.69	# 6
Morpheme Segmentaiton	UniMorph 4.0	Ensemble of hard-attention transducers (CLUZH)	macro avg (subtask 1)	96.85	# 3

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/cluzh-at-sigmorphon-2022-shared-tasks-on/morpheme-segmentaiton-on-unimorph-4-0)](https://paperswithcode.com/sota/morpheme-segmentaiton-on-unimorph-4-0?p=cluzh-at-sigmorphon-2022-shared-tasks-on)`

CLUZH at SIGMORPHON 2022 Shared Tasks on Morpheme Segmentation and Inflection Generation

NAACL (SIGMORPHON) 2022 · Silvan Wehrli, Simon Clematide, Peter Makarov ·

This paper describes the submissions of the team of the Department of Computational Linguistics, University of Zurich, to the SIGMORPHON 2022 Shared Tasks on Morpheme Segmentation and Inflection Generation. Our submissions use a character-level neural transducer that operates over traditional edit actions. While this model has been found particularly wellsuited for low-resource settings, using it with large data quantities has been difficult. Existing implementations could not fully profit from GPU acceleration and did not efficiently implement mini-batch training, which could be tricky for a transition-based system. For this year’s submission, we have ported the neural transducer to PyTorch and implemented true mini-batch training. This has allowed us to successfully scale the approach to large data quantities and conduct extensive experimentation. We report competitive results for morpheme segmentation (including sharing first place in part 2 of the challenge). We also demonstrate that reducing sentence-level morpheme segmentation to a word-level problem is a simple yet effective strategy. Additionally, we report strong results in inflection generation (the overall best result for large training sets in part 1, the best results in low-resource learning trajectories in part 2). Our code is publicly available.

PDF Abstract

Code

Add Remove Mark official

slvnwhrl/il-reimplementation official

Tasks

Add Remove

Morpheme Segmentaiton

Morphological Inflection

Segmentation

Sentence

Datasets

UniMorph 4.0

Results from the Paper

Add Remove

Ranked #3 on Morpheme Segmentaiton on UniMorph 4.0 (f1 macro avg (subtask 2) metric)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Morpheme Segmentaiton	UniMorph 4.0	CLUZH-3	f1 macro avg (subtask 2)	88.14	# 3	Compare
Morpheme Segmentaiton	UniMorph 4.0	CLUZH-3	lev dist (subtask 2)	5.58	# 4	Compare
Morpheme Segmentaiton	UniMorph 4.0	CLUZH-2	f1 macro avg (subtask 2)	87.93	# 4	Compare
Morpheme Segmentaiton	UniMorph 4.0	CLUZH-2	lev dist (subtask 2)	5.62	# 5	Compare
Morpheme Segmentaiton	UniMorph 4.0	CLUZH-1	f1 macro avg (subtask 2)	87.68	# 5	Compare
Morpheme Segmentaiton	UniMorph 4.0	CLUZH-1	lev dist (subtask 2)	5.69	# 6	Compare
Morpheme Segmentaiton	UniMorph 4.0	Ensemble of hard-attention transducers (CLUZH)	macro avg (subtask 1)	96.85	# 3	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

CLUZH at SIGMORPHON 2022 Shared Tasks on Morpheme Segmentation and Inflection Generation

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit Add Remove

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Add Remove

Methods

Add Remove