TASK	DATASET	MODEL	METRIC NAME	METRIC VALUE	GLOBAL RANK
Bangla Spelling Error Correction	DPCSpell-Bangla-SEC-Corpus	GRUSeq2Seq	Exact Match Accuracy	75.56%	# 4
Machine Translation	IWSLT2015 German-English	Bi-GRU (MLE+SLE)	BLEU score	28.53	# 11
Dialogue Generation	Persona-Chat	Seq2Seq + Attention	Avg F1	16.18	# 4
Machine Translation	WMT2014 English-French	RNN-search50*	BLEU score	36.2	# 43

Badge	Markdown
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/neural-machine-translation-by-jointly/bangla-spelling-error-correction-on-dpcspell)](https://paperswithcode.com/sota/bangla-spelling-error-correction-on-dpcspell?p=neural-machine-translation-by-jointly)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/neural-machine-translation-by-jointly/dialogue-generation-on-persona-chat-1)](https://paperswithcode.com/sota/dialogue-generation-on-persona-chat-1?p=neural-machine-translation-by-jointly)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/neural-machine-translation-by-jointly/machine-translation-on-iwslt2015-german)](https://paperswithcode.com/sota/machine-translation-on-iwslt2015-german?p=neural-machine-translation-by-jointly)`
	`[![PWC](https://img.shields.io/endpoint.svg?url=https://paperswithcode.com/badge/neural-machine-translation-by-jointly/machine-translation-on-wmt2014-english-french)](https://paperswithcode.com/sota/machine-translation-on-wmt2014-english-french?p=neural-machine-translation-by-jointly)`

Neural Machine Translation by Jointly Learning to Align and Translate

1 Sep 2014 · Dzmitry Bahdanau, Kyunghyun Cho, Yoshua Bengio ·

Neural machine translation is a recently proposed approach to machine translation. Unlike the traditional statistical machine translation, the neural machine translation aims at building a single neural network that can be jointly tuned to maximize the translation performance. The models proposed recently for neural machine translation often belong to a family of encoder-decoders and consists of an encoder that encodes a source sentence into a fixed-length vector from which a decoder generates a translation. In this paper, we conjecture that the use of a fixed-length vector is a bottleneck in improving the performance of this basic encoder-decoder architecture, and propose to extend this by allowing a model to automatically (soft-)search for parts of a source sentence that are relevant to predicting a target word, without having to form these parts as a hard segment explicitly. With this new approach, we achieve a translation performance comparable to the existing state-of-the-art phrase-based system on the task of English-to-French translation. Furthermore, qualitative analysis reveals that the (soft-)alignments found by the model agree well with our intuition.

PDF Abstract

Code

Add Remove Mark official

graykode/nlp-tutorial

↳ Quickstart in

Colab

13,691

brightmart/text_classification

7,742

bentrevett/pytorch-seq2seq

↳ Quickstart in

Colab

5,162

farizrahman4u/seq2seq

3,173

philipperemy/keras-attention-mechan…

2,798

See all 121 implementations

Tasks

Add Remove

Bangla Spelling Error Correction

Dialogue Generation

Machine Translation

Sentence

Translation

Datasets

WMT 2014

PERSONA-CHAT DPCSpell-Bangla-SEC-Corpus

Results from the Paper

Edit

Ranked #4 on Dialogue Generation on Persona-Chat (using extra training data)

Get a GitHub badge

Task	Dataset	Model	Metric Name	Metric Value	Global Rank	Benchmark
Bangla Spelling Error Correction	DPCSpell-Bangla-SEC-Corpus	GRUSeq2Seq	Exact Match Accuracy	75.56%	# 4	Compare
Machine Translation	IWSLT2015 German-English	Bi-GRU (MLE+SLE)	BLEU score	28.53	# 11	Compare
Dialogue Generation	Persona-Chat	Seq2Seq + Attention	Avg F1	16.18	# 4	Compare
Machine Translation	WMT2014 English-French	RNN-search50*	BLEU score	36.2	# 43	Compare

Methods

Add Remove

Additive Attention • Tanh Activation

Edit Social Preview

Neural Machine Translation by Jointly Learning to Align and Translate

Code Edit Add Remove Mark official

Tasks Edit Add Remove

Datasets Edit

Results from the Paper Edit

Methods Edit Add Remove

Code

Add Remove Mark official

Tasks

Add Remove

Datasets

Results from the Paper

Edit

Methods

Add Remove