no code implementations • 7 Mar 2024 • Keshav Santhanam, Deepti Raghavan, Muhammad Shahir Rahman, Thejas Venkatesh, Neha Kunjal, Pratiksha Thaker, Philip Levis, Matei Zaharia
We present ALTO, a network orchestrator for efficiently serving compound AI systems such as pipelines of language models.