Search Results for author: Baolin Li

Found 10 papers, 0 papers with code

Toward Sustainable GenAI using Generation Directives for Carbon-Friendly Large Language Model Inference

no code implementations • 19 Mar 2024 • Baolin Li, Yankai Jiang, Vijay Gadepally, Devesh Tiwari

The rapid advancement of Generative Artificial Intelligence (GenAI) across diverse sectors raises significant environmental concerns, notably the carbon emissions from their cloud and high performance computing (HPC) infrastructure.

Language Modelling Large Language Model

Paper
Add Code

Sustainable Supercomputing for AI: GPU Power Capping at HPC Scale

no code implementations • 25 Feb 2024 • Dan Zhao, Siddharth Samsi, Joseph McDonald, Baolin Li, David Bestor, Michael Jones, Devesh Tiwari, Vijay Gadepally

In this paper, we study the aggregate effect of power-capping GPUs on GPU temperature and power draw at a research supercomputing center.

Paper
Add Code

Synergistic Signals: Exploiting Co-Engagement and Semantic Links via Graph Neural Networks

no code implementations • 7 Dec 2023 • Zijie Huang, Baolin Li, Hafez Asgharzadeh, Anne Cocos, Lingyi Liu, Evan Cox, Colby Wise, Sudarshan Lamkhede

Given a set of candidate entities (e. g. movie titles), the ability to identify similar entities is a core capability of many recommender systems.

Collaborative Filtering Recommendation Systems +1

Paper
Add Code

From Words to Watts: Benchmarking the Energy Costs of Large Language Model Inference

no code implementations • 4 Oct 2023 • Siddharth Samsi, Dan Zhao, Joseph McDonald, Baolin Li, Adam Michaleas, Michael Jones, William Bergeron, Jeremy Kepner, Devesh Tiwari, Vijay Gadepally

Large language models (LLMs) have exploded in popularity due to their new generative capabilities that go far beyond prior state-of-the-art.

Benchmarking GSM8K +2

Paper
Add Code

KAIROS: Building Cost-Efficient Machine Learning Inference Systems with Heterogeneous Cloud Resources

no code implementations • 12 Oct 2022 • Baolin Li, Siddharth Samsi, Vijay Gadepally, Devesh Tiwari

Online inference is becoming a key service product for many businesses, deployed in cloud platforms to meet customer demands.

Paper
Add Code

RIBBON: Cost-Effective and QoS-Aware Deep Learning Model Inference using a Diverse Pool of Cloud Computing Instances

no code implementations • 23 Jul 2022 • Baolin Li, Rohan Basu Roy, Tirthak Patel, Vijay Gadepally, Karen Gettings, Devesh Tiwari

Deep learning model inference is a key service in many businesses and scientific discovery processes.

Bayesian Optimization Cloud Computing +2

Paper
Add Code

Great Power, Great Responsibility: Recommendations for Reducing Energy for Training Language Models

no code implementations • Findings (NAACL) 2022 • Joseph McDonald, Baolin Li, Nathan Frey, Devesh Tiwari, Vijay Gadepally, Siddharth Samsi

In particular, we focus on techniques to measure energy usage and different hardware and datacenter-oriented settings that can be tuned to reduce energy consumption for training and inference for language models.

Cloud Computing Language Modelling

Paper
Add Code

The MIT Supercloud Workload Classification Challenge

no code implementations • 12 Apr 2022 • Benny J. Tang, Qiqi Chen, Matthew L. Weiss, Nathan Frey, Joseph McDonald, David Bestor, Charles Yee, William Arcand, Chansup Byun, Daniel Edelman, Matthew Hubbell, Michael Jones, Jeremy Kepner, Anna Klein, Adam Michaleas, Peter Michaleas, Lauren Milechin, Julia Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Andrew Bowne, Lindsey McEvoy, Baolin Li, Devesh Tiwari, Vijay Gadepally, Siddharth Samsi

We introduce a labelled dataset that can be used to develop new approaches to workload classification and present initial results based on existing approaches.

Classification

Paper
Add Code

Benchmarking Resource Usage for Efficient Distributed Deep Learning

no code implementations • 28 Jan 2022 • Nathan C. Frey, Baolin Li, Joseph McDonald, Dan Zhao, Michael Jones, David Bestor, Devesh Tiwari, Vijay Gadepally, Siddharth Samsi

Deep learning (DL) workflows demand an ever-increasing budget of compute and energy in order to achieve outsized gains.

Benchmarking

Paper
Add Code

The MIT Supercloud Dataset

no code implementations • 4 Aug 2021 • Siddharth Samsi, Matthew L Weiss, David Bestor, Baolin Li, Michael Jones, Albert Reuther, Daniel Edelman, William Arcand, Chansup Byun, John Holodnack, Matthew Hubbell, Jeremy Kepner, Anna Klein, Joseph McDonald, Adam Michaleas, Peter Michaleas, Lauren Milechin, Julia Mullen, Charles Yee, Benjamin Price, Andrew Prout, Antonio Rosa, Allan Vanterpool, Lindsey McEvoy, Anson Cheng, Devesh Tiwari, Vijay Gadepally

In this paper we introduce the MIT Supercloud Dataset which aims to foster innovative AI/ML approaches to the analysis of large scale HPC and datacenter/cloud operations.

Scheduling

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.