BilBOWA: Fast Bilingual Distributed Representations without Word Alignments

9 Oct 2014 Stephan Gouws Yoshua Bengio Greg Corrado

We introduce BilBOWA (Bilingual Bag-of-Words without Alignments), a simple and computationally-efficient model for learning bilingual distributed representations of words which can scale to large monolingual datasets and does not require word-aligned parallel training data. Instead it trains directly on monolingual data and extracts a bilingual signal from a smaller set of raw-text sentence-aligned data... (read more)

PDF Abstract

Datasets


  Add Datasets introduced or used in this paper
TASK DATASET MODEL METRIC NAME METRIC VALUE GLOBAL RANK RESULT BENCHMARK
Document Classification Reuters De-En BilBOWA Accuracy 75 # 1
Document Classification Reuters En-De BilBOWA Accuracy 86.5 # 1

Methods used in the Paper


METHOD TYPE
🤖 No Methods Found Help the community by adding them if they're not listed; e.g. Deep Residual Learning for Image Recognition uses ResNet