Search Results for author: Ryan Stutsman

Found 2 papers, 1 papers with code

Packrat: Automatic Reconfiguration for Latency Minimization in CPU-based DNN Serving

no code implementations30 Nov 2023 Ankit Bhardwaj, Amar Phanishayee, Deepak Narayanan, Mihail Tarta, Ryan Stutsman

We present Packrat, a new serving system for online inference that given a model and batch size ($B$) algorithmically picks the optimal number of instances ($i$), the number of threads each should be allocated ($t$), and the batch sizes each should operate on ($b$) that minimizes latency.

BPF for storage: an exokernel-inspired approach

1 code implementation25 Feb 2021 Yu Jian Wu, Hongyi Wang, Yuhong Zhong, Asaf Cidon, Ryan Stutsman, Amy Tai, Junfeng Yang

The overhead of the kernel storage path accounts for half of the access latency for new NVMe storage devices.

Operating Systems Databases

Cannot find the paper you are looking for? You can Submit a new open access paper.