Search Results for author: Ken Birman

Found 3 papers, 1 papers with code

Compass: A Decentralized Scheduler for Latency-Sensitive ML Workflows

no code implementations27 Feb 2024 Yuting Yang, Andrea Merlina, Weijia Song, Tiancheng Yuan, Ken Birman, Roman Vitenberg

We consider ML query processing in distributed systems where GPU-enabled workers coordinate to execute complex queries: a computing style often seen in applications that interact with users in support of image processing and natural language processing.

Management

Low-Latency ML Inference by Grouping Correlated Data Objects and Computation

no code implementations30 Nov 2023 Thiago Garrett, Weijia Song, Roman Vitenberg, Ken Birman

ML inference workflows often require low latency and high throughput, yet we lack good options for addressing this need.

Management Scheduling

Cascade: A Platform for Delay-Sensitive Edge Intelligence

1 code implementation29 Nov 2023 Weijia Song, Thiago Garrett, Yuting Yang, Mingzhao Liu, Edward Tremel, Lorenzo Rosa, Andrea Merlina, Roman Vitenberg, Ken Birman

Interactive intelligent computing applications are increasingly prevalent, creating a need for AI/ML platforms optimized to reduce per-event latency while maintaining high throughput and efficient resource management.

Management

Cannot find the paper you are looking for? You can Submit a new open access paper.