Search Results for author: Marco Zeller

Found 1 papers, 0 papers with code

Chameleon: a heterogeneous and disaggregated accelerator system for retrieval-augmented language models

no code implementations • 15 Oct 2023 • Wenqi Jiang, Marco Zeller, Roger Waleffe, Torsten Hoefler, Gustavo Alonso

The heterogeneity ensures efficient acceleration of both LM inference and retrieval, while the accelerator disaggregation enables the system to independently scale both types of accelerators to fulfill diverse RALM requirements.

Language Modelling Retrieval +1

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.