no code implementations • 19 Mar 2024 • Baolin Li, Yankai Jiang, Vijay Gadepally, Devesh Tiwari
The rapid advancement of Generative Artificial Intelligence (GenAI) across diverse sectors raises significant environmental concerns, notably the carbon emissions from their cloud and high performance computing (HPC) infrastructure.
no code implementations • 25 Feb 2024 • Dan Zhao, Siddharth Samsi, Joseph McDonald, Baolin Li, David Bestor, Michael Jones, Devesh Tiwari, Vijay Gadepally
In this paper, we study the aggregate effect of power-capping GPUs on GPU temperature and power draw at a research supercomputing center.
no code implementations • 7 Dec 2023 • Zijie Huang, Baolin Li, Hafez Asgharzadeh, Anne Cocos, Lingyi Liu, Evan Cox, Colby Wise, Sudarshan Lamkhede
Given a set of candidate entities (e. g. movie titles), the ability to identify similar entities is a core capability of many recommender systems.
no code implementations • 4 Oct 2023 • Siddharth Samsi, Dan Zhao, Joseph McDonald, Baolin Li, Adam Michaleas, Michael Jones, William Bergeron, Jeremy Kepner, Devesh Tiwari, Vijay Gadepally
Large language models (LLMs) have exploded in popularity due to their new generative capabilities that go far beyond prior state-of-the-art.
no code implementations • 12 Oct 2022 • Baolin Li, Siddharth Samsi, Vijay Gadepally, Devesh Tiwari
Online inference is becoming a key service product for many businesses, deployed in cloud platforms to meet customer demands.
no code implementations • 23 Jul 2022 • Baolin Li, Rohan Basu Roy, Tirthak Patel, Vijay Gadepally, Karen Gettings, Devesh Tiwari
Deep learning model inference is a key service in many businesses and scientific discovery processes.
no code implementations • Findings (NAACL) 2022 • Joseph McDonald, Baolin Li, Nathan Frey, Devesh Tiwari, Vijay Gadepally, Siddharth Samsi
In particular, we focus on techniques to measure energy usage and different hardware and datacenter-oriented settings that can be tuned to reduce energy consumption for training and inference for language models.
no code implementations • 12 Apr 2022 • Benny J. Tang, Qiqi Chen, Matthew L. Weiss, Nathan Frey, Joseph McDonald, David Bestor, Charles Yee, William Arcand, Chansup Byun, Daniel Edelman, Matthew Hubbell, Michael Jones, Jeremy Kepner, Anna Klein, Adam Michaleas, Peter Michaleas, Lauren Milechin, Julia Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Andrew Bowne, Lindsey McEvoy, Baolin Li, Devesh Tiwari, Vijay Gadepally, Siddharth Samsi
We introduce a labelled dataset that can be used to develop new approaches to workload classification and present initial results based on existing approaches.
no code implementations • 28 Jan 2022 • Nathan C. Frey, Baolin Li, Joseph McDonald, Dan Zhao, Michael Jones, David Bestor, Devesh Tiwari, Vijay Gadepally, Siddharth Samsi
Deep learning (DL) workflows demand an ever-increasing budget of compute and energy in order to achieve outsized gains.
no code implementations • 4 Aug 2021 • Siddharth Samsi, Matthew L Weiss, David Bestor, Baolin Li, Michael Jones, Albert Reuther, Daniel Edelman, William Arcand, Chansup Byun, John Holodnack, Matthew Hubbell, Jeremy Kepner, Anna Klein, Joseph McDonald, Adam Michaleas, Peter Michaleas, Lauren Milechin, Julia Mullen, Charles Yee, Benjamin Price, Andrew Prout, Antonio Rosa, Allan Vanterpool, Lindsey McEvoy, Anson Cheng, Devesh Tiwari, Vijay Gadepally
In this paper we introduce the MIT Supercloud Dataset which aims to foster innovative AI/ML approaches to the analysis of large scale HPC and datacenter/cloud operations.