no code implementations • 15 Oct 2020 • Phillip Keung, Julian Salazar, Yichao Lu, Noah A. Smith
We then improve an XLM-based unsupervised neural MT system pre-trained on Wikipedia by supplementing it with pseudo-parallel text mined from the same corpus, boosting unsupervised translation performance by up to 3. 5 BLEU on the WMT'14 French-English and WMT'16 German-English tasks and outperforming the previous state-of-the-art.
no code implementations • EMNLP 2020 • Phillip Keung, Yichao Lu, György Szarvas, Noah A. Smith
We present the Multilingual Amazon Reviews Corpus (MARC), a large-scale collection of Amazon reviews for multilingual text classification.
no code implementations • ACL 2020 • Jiawei Zhou, Phillip Keung
Non-autoregressive (NAR) neural machine translation is usually done via knowledge distillation from an autoregressive (AR) model.
no code implementations • EMNLP 2020 • Phillip Keung, Yichao Lu, Julian Salazar, Vikas Bhardwaj
Multilingual contextual embeddings have demonstrated state-of-the-art performance in zero-shot cross-lingual transfer learning, where multilingual BERT is fine-tuned on one source language and evaluated on a different target language.
1 code implementation • 12 Feb 2020 • Phillip Keung, Wei Niu, Yichao Lu, Julian Salazar, Vikas Bhardwaj
We discuss the problem of echographic transcription in autoregressive sequence-to-sequence attentional architectures for automatic speech recognition, where a model produces very long sequences of repetitive outputs when presented with out-of-domain utterances.
no code implementations • IJCNLP 2019 • Phillip Keung, Yichao Lu, Vikas Bhardwaj
We report the magnitude of the improvement on the multilingual MLDoc text classification and CoNLL 2002/2003 named entity recognition tasks.
no code implementations • WS 2018 • Yichao Lu, Phillip Keung, Faisal Ladhak, Vikas Bhardwaj, Shaonan Zhang, Jason Sun
We incorporate an explicit neural interlingua into a multilingual encoder-decoder neural machine translation (NMT) architecture.
no code implementations • 28 Mar 2017 • Yichao Lu, Phillip Keung, Shaonan Zhang, Jason Sun, Vikas Bhardwaj
We describe a prototype dialogue response generation model for the customer service domain at Amazon.