Search Results for author: Jovan Stojkovic

Found 1 papers, 0 papers with code

Towards Greener LLMs: Bringing Energy-Efficiency to the Forefront of LLM Inference

no code implementations • 29 Mar 2024 • Jovan Stojkovic, Esha Choukse, Chaojie Zhang, Inigo Goiri, Josep Torrellas

Given the high compute and memory requirements of modern LLMs, more and more top-of-the-line GPUs are being deployed to serve these models.

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.