Search Results for author: Chaojie Zhang

Found 2 papers, 1 papers with code

Towards Greener LLMs: Bringing Energy-Efficiency to the Forefront of LLM Inference

no code implementations • 29 Mar 2024 • Jovan Stojkovic, Esha Choukse, Chaojie Zhang, Inigo Goiri, Josep Torrellas

Given the high compute and memory requirements of modern LLMs, more and more top-of-the-line GPUs are being deployed to serve these models.

Paper
Add Code

POLCA: Power Oversubscription in LLM Cloud Providers

1 code implementation • 24 Aug 2023 • Pratyush Patel, Esha Choukse, Chaojie Zhang, Íñigo Goiri, Brijesh Warrier, Nithish Mahalingam, Ricardo Bianchini

We propose POLCA, our framework for power oversubscription that is robust, reliable, and readily deployable for GPU clusters.

6,574

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.