no code implementations • 25 Mar 2024 • Guoliang He, Eiko Yoneki
In this work, we explore the possibility of GPU native instruction optimization to further push the CUDA kernels to extreme performance.
1 code implementation • 28 Apr 2023 • Guoliang He, Sean Parker, Eiko Yoneki
Tensor graph superoptimisation systems perform a sequence of subgraph substitution to neural networks, to find the optimal computation graph structure.
1 code implementation • 8 Mar 2023 • Guoliang He, Zak Singh, Eiko Yoneki
Rewrite systems [6, 10, 12] have been widely employing equality saturation [9], which is an optimisation methodology that uses a saturated e-graph to represent all possible sequences of rewrite simultaneously, and then extracts the optimal one.