TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK	REMOVE
Cross-Lingual Document Classification	Reuters RCV1/RCV2 English-to-German	Bi+	Accuracy	88.1	# 2
Cross-Lingual Document Classification	Reuters RCV1/RCV2 German-to-English	Bi+	Accuracy	79.2	# 2

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/multilingual-models-for-compositional/cross-lingual-document-classification-on-12)](https://paperswithcode.com/sota/cross-lingual-document-classification-on-12?p=multilingual-models-for-compositional)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/multilingual-models-for-compositional/cross-lingual-document-classification-on-13)](https://paperswithcode.com/sota/cross-lingual-document-classification-on-13?p=multilingual-models-for-compositional)`

Multilingual Models for Compositional Distributed Semantics

ACL 2014 · Karl Moritz Hermann, Phil Blunsom ·

We present a novel technique for learning semantic representations, which extends the distributional hypothesis to multilingual data and joint-space embeddings. Our models leverage parallel data and learn to strongly align the embeddings of semantically equivalent sentences, while maintaining sufficient distance between those of dissimilar sentences. The models do not rely on word alignments or any syntactic information and are successfully applied to a number of diverse languages. We extend our approach to learn semantic representations at the document level, too. We evaluate these models on two cross-lingual document classification tasks, outperforming the prior state of the art. Through qualitative analysis and the study of pivoting effects we demonstrate that our representations are semantically plausible and can capture semantic relationships across languages without parallel data.

PDF Abstract ACL 2014 PDF ACL 2014 Abstract

Code

Add Remove Mark official

karlmoritz/bicvm

Tasks

Add Remove

Cross-Lingual Document Classification

Document Classification

General Classification

Learning Semantic Representations

Datasets

RCV1

Results from the Paper

Edit

Ranked #2 on Cross-Lingual Document Classification on Reuters RCV1/RCV2 English-to-German

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Result	Benchmark
Cross-Lingual Document Classification	Reuters RCV1/RCV2 English-to-German	Bi+	Accuracy	88.1	# 2		Compare
Cross-Lingual Document Classification	Reuters RCV1/RCV2 German-to-English	Bi+	Accuracy	79.2	# 2		Compare

Methods

Add Remove

No methods listed for this paper. Add relevant methods here

Edit Social Preview

Multilingual Models for Compositional Distributed Semantics

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove