Balancing Efficiency and Flexibility for DNN Acceleration via Temporal GPU-Systolic Array Integration

18 Feb 2020Cong GuoYangjie ZhouJingwen LengYuhao ZhuZidong DuQuan ChenChao LiMinyi GuoBin Yao

The research interest in specialized hardware accelerators for deep neural networks (DNN) spiked recently owing to their superior performance and efficiency. However, today's DNN accelerators primarily focus on accelerating specific "kernels" such as convolution and matrix multiplication, which are vital but only part of an end-to-end DNN-enabled application... (read more)

PDF Abstract


No code implementations yet. Submit your code now


Results from the Paper

  Submit results from this paper to get state-of-the-art GitHub badges and help the community compare results to other papers.