no code implementations • 14 Jul 2020 • Jayashree Mohan, Amar Phanishayee, Ashish Raniwala, Vijay Chidambaram
We analyze nine different models across three tasks and four datasets while varying factors such as the amount of memory, number of CPU threads, storage device, GPU generation etc on servers that are a part of a large production cluster at Microsoft.