Search Results for author: Andy Davis

Found 5 papers, 3 papers with code

Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer

3 code implementations23 Jan 2017 Noam Shazeer, Azalia Mirhoseini, Krzysztof Maziarz, Andy Davis, Quoc Le, Geoffrey Hinton, Jeff Dean

In this work, we address these challenges and finally realize the promise of conditional computation, achieving greater than 1000x improvements in model capacity with only minor losses in computational efficiency on modern GPU clusters.

Language Modelling Machine Translation +1

Cannot find the paper you are looking for? You can Submit a new open access paper.