A Hardware-Software Blueprint for Flexible Deep Learning Specialization

11 Jul 2018Thierry MoreauTianqi ChenLuis VegaJared RoeschEddie YanLianmin ZhengJosh FrommZiheng JiangLuis CezeCarlos GuestrinArvind Krishnamurthy

Specialized Deep Learning (DL) acceleration stacks, designed for a specific set of frameworks, model architectures, operators, and data types, offer the allure of high performance while sacrificing flexibility. Changes in algorithms, models, operators, or numerical systems threaten the viability of specialized hardware accelerators... (read more)

PDF Abstract

Results from the Paper


  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.

Methods used in the Paper


METHOD TYPE
🤖 No Methods Found Help the community by adding them if they're not listed; e.g. Deep Residual Learning for Image Recognition uses ResNet