TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Chunking	CoNLL 2000	BiLSTM-CRF	Exact Span F1	95.18	# 9
Chunking	CoNLL 2000	Adversarial Training	Exact Span F1	95.25	# 8
Named Entity Recognition (NER)	CoNLL 2003 (English)	Adversarial Bi-LSTM	F1	91.56	# 59
Part-Of-Speech Tagging	Penn Treebank	Adversarial Bi-LSTM	Accuracy	97.59	# 8
Part-Of-Speech Tagging	UD	Adversarial Bi-LSTM	Avg accuracy	96.65	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/robust-multilingual-part-of-speech-tagging/part-of-speech-tagging-on-ud)](https://paperswithcode.com/sota/part-of-speech-tagging-on-ud?p=robust-multilingual-part-of-speech-tagging)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/robust-multilingual-part-of-speech-tagging/chunking-on-conll-2000)](https://paperswithcode.com/sota/chunking-on-conll-2000?p=robust-multilingual-part-of-speech-tagging)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/robust-multilingual-part-of-speech-tagging/part-of-speech-tagging-on-penn-treebank)](https://paperswithcode.com/sota/part-of-speech-tagging-on-penn-treebank?p=robust-multilingual-part-of-speech-tagging)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/robust-multilingual-part-of-speech-tagging/named-entity-recognition-ner-on-conll-2003)](https://paperswithcode.com/sota/named-entity-recognition-ner-on-conll-2003?p=robust-multilingual-part-of-speech-tagging)`

Robust Multilingual Part-of-Speech Tagging via Adversarial Training

NAACL 2018 · Michihiro Yasunaga, Jungo Kasai, Dragomir Radev ·

Adversarial training (AT) is a powerful regularization method for neural networks, aiming to achieve robustness to input perturbations. Yet, the specific effects of the robustness obtained from AT are still unclear in the context of natural language processing. In this paper, we propose and analyze a neural POS tagging model that exploits AT. In our experiments on the Penn Treebank WSJ corpus and the Universal Dependencies (UD) dataset (27 languages), we find that AT not only improves the overall tagging accuracy, but also 1) prevents over-fitting well in low resource languages and 2) boosts tagging accuracy for rare / unseen words. We also demonstrate that 3) the improved tagging performance by AT contributes to the downstream task of dependency parsing, and that 4) AT helps the model to learn cleaner word representations. 5) The proposed AT model is generally effective in different sequence labeling tasks. These positive results motivate further use of AT for natural language tasks.

PDF Abstract NAACL 2018 PDF NAACL 2018 Abstract

Code

Add Remove Mark official

michiyasunaga/pos_adv official

Tasks

Add Remove

Chunking

Dependency Parsing

Named Entity Recognition (NER)

Part-Of-Speech Tagging

POS

POS Tagging

Datasets

Penn Treebank CoNLL 2003

Universal Dependencies CoNLL CoNLL-2000

Results from the Paper

Edit

Ranked #2 on Part-Of-Speech Tagging on UD

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Chunking	CoNLL 2000	BiLSTM-CRF	Exact Span F1	95.18	# 9	Compare
Chunking	CoNLL 2000	Adversarial Training	Exact Span F1	95.25	# 8	Compare
Named Entity Recognition (NER)	CoNLL 2003 (English)	Adversarial Bi-LSTM	F1	91.56	# 59	Compare
Part-Of-Speech Tagging	Penn Treebank	Adversarial Bi-LSTM	Accuracy	97.59	# 8	Compare
Part-Of-Speech Tagging	UD	Adversarial Bi-LSTM	Avg accuracy	96.65	# 2	Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Robust Multilingual Part-of-Speech Tagging via Adversarial Training

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove