Search Results for author: Zachary DeVito

Found 8 papers, 2 papers with code

Generative AI Beyond LLMs: System Implications of Multi-Modal Generation

no code implementations • 22 Dec 2023 • Alicia Golden, Samuel Hsia, Fei Sun, Bilge Acun, Basil Hosmer, Yejin Lee, Zachary DeVito, Jeff Johnson, Gu-Yeon Wei, David Brooks, Carole-Jean Wu

As the development of large-scale Generative AI models evolve beyond text (1D) generation to include image (2D) and video (3D) generation, processing spatial and temporal information presents unique challenges to quality, performance, and efficiency.

3D Generation

Paper
Add Code

MAD Max Beyond Single-Node: Enabling Large Machine Learning Model Acceleration on Distributed Systems

no code implementations • 4 Oct 2023 • Samuel Hsia, Alicia Golden, Bilge Acun, Newsha Ardalani, Zachary DeVito, Gu-Yeon Wei, David Brooks, Carole-Jean Wu

Training and deploying large machine learning (ML) models is time-consuming and requires significant distributed computing infrastructures.

Distributed Computing

Paper
Add Code

A Theory on Adam Instability in Large-Scale Machine Learning

no code implementations • 19 Apr 2023 • Igor Molybog, Peter Albert, Moya Chen, Zachary DeVito, David Esiobu, Naman Goyal, Punit Singh Koura, Sharan Narang, Andrew Poulton, Ruan Silva, Binh Tang, Diana Liskovich, Puxin Xu, Yuchen Zhang, Melanie Kambadur, Stephen Roller, Susan Zhang

We present a theory for the previously unexplained divergent behavior noticed in the training of large language models.

Language Modelling

Paper
Add Code

Torch.fx: Practical Program Capture and Transformation for Deep Learning in Python

no code implementations • 15 Dec 2021 • James K. Reed, Zachary DeVito, Horace He, Ansley Ussery, Jason Ansel

Modern deep learning frameworks provide imperative, eager execution programming interfaces embedded in Python to provide a productive development experience.

Paper
Add Code

Using Python for Model Inference in Deep Learning

no code implementations • 1 Apr 2021 • Zachary DeVito, Jason Ansel, Will Constable, Michael Suo, Ailing Zhang, Kim Hazelwood

We evaluate our design on a suite of popular PyTorch models on Github, showing how they can be packaged in our inference format, and comparing their performance to TorchScript.

Model extraction

Paper
Add Code

Tensor Comprehensions: Framework-Agnostic High-Performance Machine Learning Abstractions

4 code implementations • 13 Feb 2018 • Nicolas Vasilache, Oleksandr Zinenko, Theodoros Theodoridis, Priya Goyal, Zachary DeVito, William S. Moses, Sven Verdoolaege, Andrew Adams, Albert Cohen

Deep learning models with convolutional and recurrent networks are now ubiquitous and analyze massive amounts of audio, image, video, text and graph data, with applications in automatic translation, speech-to-text, scene understanding, ranking user preferences, ad placement, etc.

BIG-bench Machine Learning Management +2

1,754

Paper
Code

Automatic Differentiation in PyTorch

1 code implementation • NIPS 2017 2017 • Adam Paszke, Sam Gross, Soumith Chintala, Gregory Chanan, Edward Yang, Zachary DeVito, Zeming Lin, Alban Desmaison, Luca Antiga, Adam Lerer

In this article, we describe an automatic differentiation module of PyTorch — a library designed to enable rapid research on machine learning models.

Clustering Dimensionality Reduction +1

77,871

Paper
Code

Opt: A Domain Specific Language for Non-linear Least Squares Optimization in Graphics and Imaging

no code implementations • 22 Apr 2016 • Zachary DeVito, Michael Mara, Michael Zollhöfer, Gilbert Bernstein, Jonathan Ragan-Kelley, Christian Theobalt, Pat Hanrahan, Matthew Fisher, Matthias Nießner

Many graphics and vision problems can be expressed as non-linear least squares optimizations of objective functions over visual data, such as images and meshes.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.