no code implementations • ICLR 2019 • Haihao Shen, Jiong Gong, Xiaoli Liu, Guoming Zhang, Ge Jin, and Eric Lin
High throughput and low latency inference of deep neural networks are critical for the deployment of deep learning applications.
1 code implementation • 4 May 2018 • Jiong Gong, Haihao Shen, Guoming Zhang, Xiaoli Liu, Shane Li, Ge Jin, Niharika Maheshwari, Evarist Fomenko, Eden Segal
High throughput and low latency inference of deep neural networks are critical for the deployment of deep learning applications.