Cold Fusion: Training Seq2Seq Models Together with Language Models

ICLR 2018 Anuroop SriramHeewoo JunSanjeev SatheeshAdam Coates

Sequence-to-sequence (Seq2Seq) models with attention have excelled at tasks which involve generating natural language sentences such as machine translation, image captioning and speech recognition. Performance has further been improved by leveraging unlabeled data, often in the form of a language model... (read more)

PDF Abstract ICLR 2018 PDF ICLR 2018 Abstract

Results from the Paper


  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods used in the Paper