2 code implementations • 6 Mar 2021 • Shabnam Daghaghi, Nicholas Meisburger, Mengnan Zhao, Yong Wu, Sameh Gobriel, Charlie Tai, Anshumali Shrivastava
Our work highlights several novel perspectives and opportunities for implementing randomized algorithms for deep learning on modern CPUs.
3 code implementations • 7 Mar 2019 • Beidi Chen, Tharun Medini, James Farwell, Sameh Gobriel, Charlie Tai, Anshumali Shrivastava
On the same CPU hardware, SLIDE is over 10x faster than TF.