TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Cross-Lingual NER	CoNLL Dutch	mBERT	F1	77.57	# 9
Cross-Lingual NER	CoNLL German	mBERT	F1	69.56	# 9
Cross-Lingual NER	CoNLL Spanish	mBERT	F1	74.96	# 8

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/beto-bentz-becas-the-surprising-cross-lingual/cross-lingual-ner-on-conll-spanish)](https://paperswithcode.com/sota/cross-lingual-ner-on-conll-spanish?p=beto-bentz-becas-the-surprising-cross-lingual)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/beto-bentz-becas-the-surprising-cross-lingual/cross-lingual-ner-on-conll-dutch)](https://paperswithcode.com/sota/cross-lingual-ner-on-conll-dutch?p=beto-bentz-becas-the-surprising-cross-lingual)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/beto-bentz-becas-the-surprising-cross-lingual/cross-lingual-ner-on-conll-german)](https://paperswithcode.com/sota/cross-lingual-ner-on-conll-german?p=beto-bentz-becas-the-surprising-cross-lingual)`

Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT

IJCNLP 2019 · Shijie Wu, Mark Dredze ·

Pretrained contextual representation models (Peters et al., 2018; Devlin et al., 2018) have pushed forward the state-of-the-art on many NLP tasks. A new release of BERT (Devlin, 2018) includes a model simultaneously pretrained on 104 languages with impressive performance for zero-shot cross-lingual transfer on a natural language inference task. This paper explores the broader cross-lingual potential of mBERT (multilingual) as a zero shot language transfer model on 5 NLP tasks covering a total of 39 languages from various language families: NLI, document classification, NER, POS tagging, and dependency parsing. We compare mBERT with the best-published methods for zero-shot cross-lingual transfer and find mBERT competitive on each task. Additionally, we investigate the most effective strategy for utilizing mBERT in this manner, determine to what extent mBERT generalizes away from language specific features, and measure factors that influence cross-lingual transfer.

PDF Abstract IJCNLP 2019 PDF IJCNLP 2019 Abstract

Code

Add Remove Mark official

shijie-wu/crosslingual-nlp official

stefan-it/fine-tuned-berts-seq

Tasks

Add Remove

Cross-Lingual NER

Cross-Lingual Transfer

Dependency Parsing

Document Classification

Natural Language Inference

NER

POS

POS Tagging

Zero-Shot Cross-Lingual Transfer

Datasets

XNLI CoNLL

CoNLL 2002 MLDoc

Results from the Paper

Edit

Ranked #8 on Cross-Lingual NER on CoNLL Spanish

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Cross-Lingual NER	CoNLL Dutch	mBERT	F1	77.57	# 9	Compare
Cross-Lingual NER	CoNLL German	mBERT	F1	69.56	# 9	Compare
Cross-Lingual NER	CoNLL Spanish	mBERT	F1	74.96	# 8	Compare

Methods

Add Remove

Adam • Attention Dropout • BERT • Dense Connections • Dropout • GELU • Layer Normalization • Linear Layer • Linear Warmup With Linear Decay • mBERT • Multi-Head Attention • Residual Connection • Scaled Dot-Product Attention • Softmax • Weight Decay • WordPiece

Edit Social Preview

Beto, Bentz, Becas: The Surprising Cross-Lingual Effectiveness of BERT

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove