Search Results for author: Joanna Yoo

Found 2 papers, 0 papers with code

Scalable Training of Language Models using JAX pjit and TPUv4

no code implementations • 13 Apr 2022 • Joanna Yoo, Kuba Perlin, Siddhartha Rao Kamalakara, João G. M. Araújo

Modern large language models require distributed training strategies due to their size.

Paper
Add Code

Improving compute efficacy frontiers with SliceOut

no code implementations • 21 Jul 2020 • Pascal Notin, Aidan N. Gomez, Joanna Yoo, Yarin Gal

Pushing forward the compute efficacy frontier in deep learning is critical for tasks that require frequent model re-training or workloads that entail training a large number of models.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.