1 code implementation • 9 Mar 2024 • Jie Liu, Zhongyuan Zhao, Zijian Ding, Benjamin Brock, Hongbo Rong, Zhiru Zhang
The ongoing trend of hardware specialization has led to a growing use of custom data formats when processing sparse workloads, which are typically memory-bound.
no code implementations • 29 Oct 2020 • Hongbo Rong, Xiaochen Hao, Yun Liang, Lidong Xu, Hong H Jiang, Pradeep Dubey
We propose a language and compiler to productively build high-performance {\it software systolic arrays} that run on GPUs.