no code implementations • 16 Jan 2024 • Prasoon Sinha, Kostis Kaffes, Neeraja J. Yadwadkar
However, today's serverless systems lack performance guarantees for function invocations, thus limiting support for performance-critical applications: we observed severe performance variability (up to 6x).
1 code implementation • 15 Jan 2024 • Dan Jacobellis, Daniel Cummings, Neeraja J. Yadwadkar
Our results indicate three key findings: (1) using generative compression, it is feasible to leverage highly compressed data while incurring a negligible impact on machine perceptual quality; (2) machine perceptual quality correlates strongly with deep similarity metrics, indicating a crucial role of these metrics in the development of machine-oriented codecs; and (3) using lossy compressed datasets, (e. g. ImageNet) for pre-training can lead to counter-intuitive scenarios where lossy compression increases machine perceptual quality rather than degrading it.
no code implementations • 27 Oct 2023 • Bodun Hu, Le Xu, Jeongyoon Moon, Neeraja J. Yadwadkar, Aditya Akella
Rapid advancements over the years have helped machine learning models reach previously hard-to-achieve goals, sometimes even exceeding human capabilities.
1 code implementation • 30 May 2019 • Francisco Romero, Qian Li, Neeraja J. Yadwadkar, Christos Kozyrakis
This paper introduces INFaaS, a managed and model-less system for distributed inference serving, where developers simply specify the performance and accuracy requirements for their applications without needing to specify a specific model-variant for each query.