Search Results for author: Michael Lui

Found 1 papers, 0 papers with code

Understanding Capacity-Driven Scale-Out Neural Recommendation Inference

no code implementations4 Nov 2020 Michael Lui, Yavuz Yetim, Özgür Özkan, Zhuoran Zhao, Shin-Yeh Tsai, Carole-Jean Wu, Mark Hempstead

One approach to support this scale is with distributed serving, or distributed inference, which divides the memory requirements of a single large model across multiple servers.

Recommendation Systems

Cannot find the paper you are looking for? You can Submit a new open access paper.