Search Results for author: Joanna Yoo

Found 2 papers, 0 papers with code

Scalable Training of Language Models using JAX pjit and TPUv4

no code implementations13 Apr 2022 Joanna Yoo, Kuba Perlin, Siddhartha Rao Kamalakara, João G. M. Araújo

Modern large language models require distributed training strategies due to their size.

Improving compute efficacy frontiers with SliceOut

no code implementations21 Jul 2020 Pascal Notin, Aidan N. Gomez, Joanna Yoo, Yarin Gal

Pushing forward the compute efficacy frontier in deep learning is critical for tasks that require frequent model re-training or workloads that entail training a large number of models.

Cannot find the paper you are looking for? You can Submit a new open access paper.