Search Results for author: Stephan Gouws

Found 10 papers, 5 papers with code

Learning from Samples of Variable Quality

no code implementations • ICLR Workshop LLD 2019 • Mostafa Dehghani, Arash Mehrjou, Stephan Gouws, Jaap Kamps, Bernhard Schölkopf

Training labels are expensive to obtain and may be of varying quality, as some may be from trusted expert labelers while others might be from heuristics or other sources of weak supervision such as crowd-sourcing.

Paper
Add Code

Universal Transformers

8 code implementations • ICLR 2019 • Mostafa Dehghani, Stephan Gouws, Oriol Vinyals, Jakob Uszkoreit, Łukasz Kaiser

Feed-forward and convolutional architectures have recently been shown to achieve superior results on some sequence modeling tasks such as machine translation, with the added advantage that they concurrently process all inputs in the sequence, leading to easy parallelization and faster training times.

Ranked #30 on Language Modelling on LAMBADA

Inductive Bias LAMBADA +4

14,883

Paper
Code

Tensor2Tensor for Neural Machine Translation

14 code implementations • WS 2018 • Ashish Vaswani, Samy Bengio, Eugene Brevdo, Francois Chollet, Aidan N. Gomez, Stephan Gouws, Llion Jones, Łukasz Kaiser, Nal Kalchbrenner, Niki Parmar, Ryan Sepassi, Noam Shazeer, Jakob Uszkoreit

Tensor2Tensor is a library for deep learning models that is well-suited for neural machine translation and includes the reference implementation of the state-of-the-art Transformer model.

Machine Translation Translation

14,883

Paper
Code

XGAN: Unsupervised Image-to-Image Translation for Many-to-Many Mappings

4 code implementations • ICLR 2018 • Amélie Royer, Konstantinos Bousmalis, Stephan Gouws, Fred Bertsch, Inbar Mosseri, Forrester Cole, Kevin Murphy

Style transfer usually refers to the task of applying color and texture information from a specific style image to a given content image while preserving the structure of the latter.

Domain Adaptation Style Transfer +2

Paper
Code

Fidelity-Weighted Learning

no code implementations • ICLR 2018 • Mostafa Dehghani, Arash Mehrjou, Stephan Gouws, Jaap Kamps, Bernhard Schölkopf

To this end, we propose "fidelity-weighted learning" (FWL), a semi-supervised student-teacher approach for training deep neural networks using weakly-labeled data.

Ad-Hoc Information Retrieval Information Retrieval +1

Paper
Add Code

Generating High-Quality and Informative Conversation Responses with Sequence-to-Sequence Models

no code implementations • EMNLP 2017 • Louis Shao, Stephan Gouws, Denny Britz, Anna Goldie, Brian Strope, Ray Kurzweil

Sequence-to-sequence models have been applied to the conversation response generation problem where the source sequence is the conversation history and the target sequence is the response.

Response Generation Translation

Paper
Add Code

Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation

28 code implementations • 26 Sep 2016 • Yonghui Wu, Mike Schuster, Zhifeng Chen, Quoc V. Le, Mohammad Norouzi, Wolfgang Macherey, Maxim Krikun, Yuan Cao, Qin Gao, Klaus Macherey, Jeff Klingner, Apurva Shah, Melvin Johnson, Xiaobing Liu, Łukasz Kaiser, Stephan Gouws, Yoshikiyo Kato, Taku Kudo, Hideto Kazawa, Keith Stevens, George Kurian, Nishant Patil, Wei Wang, Cliff Young, Jason Smith, Jason Riesa, Alex Rudnick, Oriol Vinyals, Greg Corrado, Macduff Hughes, Jeffrey Dean

To improve parallelism and therefore decrease training time, our attention mechanism connects the bottom layer of the decoder to the top layer of the encoder.

Ranked #35 on Machine Translation on WMT2014 English-French

Machine Translation NMT +2

12,609

Paper
Code

Simple task-specific bilingual word embeddings

no code implementations • HLT 2015 • Anders Søgaard, Stephan Gouws

Dependency Parsing Document Classification +7

Paper
Add Code

BilBOWA: Fast Bilingual Distributed Representations without Word Alignments

2 code implementations • 9 Oct 2014 • Stephan Gouws, Yoshua Bengio, Greg Corrado

We introduce BilBOWA (Bilingual Bag-of-Words without Alignments), a simple and computationally-efficient model for learning bilingual distributed representations of words which can scale to large monolingual datasets and does not require word-aligned parallel training data.

Ranked #1 on Document Classification on Reuters En-De

Cross-Lingual Document Classification Document Classification +3

117

Paper
Code

Deep Unsupervised Feature Learning for Natural Language Processing

no code implementations • NAACL 2012 • Stephan Gouws

Chunking

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.