3 code implementations • 7 Mar 2019 • Beidi Chen, Tharun Medini, James Farwell, Sameh Gobriel, Charlie Tai, Anshumali Shrivastava
On the same CPU hardware, SLIDE is over 10x faster than TF.
2 code implementations • 6 Mar 2021 • Shabnam Daghaghi, Nicholas Meisburger, Mengnan Zhao, Yong Wu, Sameh Gobriel, Charlie Tai, Anshumali Shrivastava
Our work highlights several novel perspectives and opportunities for implementing randomized algorithms for deep learning on modern CPUs.
no code implementations • 12 May 2023 • Gopi Krishna Jha, Anthony Thomas, Nilesh Jain, Sameh Gobriel, Tajana Rosing, Ravi Iyer
Deep learning-based recommendation systems (e. g., DLRMs) are widely used AI models to provide high-quality personalized recommendations.