Browse > Natural Language Processing > Machine Translation > Multimodal Machine Translation

Multimodal Machine Translation

7 papers with code · Natural Language Processing
Subtask of Machine Translation

Multimodal machine translation is the task of doing machine translation with multiple data sources - for example, translating "a bird is flying over water" + an image of a bird over water to German text.

State-of-the-art leaderboards

No evaluation results yet. Help compare methods by submit evaluation metrics.

Greatest papers with code

NMTPY: A Flexible Toolkit for Advanced Neural Machine Translation Systems

1 Jun 2017lium-lst/nmtpy

nmtpy has been used for LIUM's top-ranked submissions to WMT Multimodal Machine Translation and News Translation tasks in 2016 and 2017.

MULTIMODAL MACHINE TRANSLATION

Does Multimodality Help Human and Machine for Translation and Image Captioning?

WS 2016 lium-lst/nmtpy

This paper presents the systems developed by LIUM and CVC for the WMT16 Multimodal Machine Translation challenge.

IMAGE CAPTIONING MULTIMODAL MACHINE TRANSLATION

Multimodal Machine Translation with Embedding Prediction

NAACL 2019 toshohirasawa/nmtpytorch-emb-pred

Multimodal machine translation is an attractive application of neural machine translation (NMT).

MULTIMODAL MACHINE TRANSLATION WORD EMBEDDINGS

A Visual Attention Grounding Neural Model for Multimodal Machine Translation

EMNLP 2018 sampalomad/IKEA-Dataset

The model leverages a visual attention grounding mechanism that links the visual semantics with the corresponding textual semantics.

MULTIMODAL MACHINE TRANSLATION

UMONS Submission for WMT18 Multimodal Translation Task

15 Oct 2018jbdel/WMT18_MNMT

This paper describes the UMONS solution for the Multimodal Machine Translation Task presented at the third conference on machine translation (WMT18).

IMAGE CAPTIONING MULTIMODAL MACHINE TRANSLATION