no code implementations • 22 Feb 2024 • Miaoxin Wang, Xiao Wu, Jun Lin, Zhongfeng Wang
Particularly, it demonstrates efficient support for large-kernel CNNs, achieving throughputs of 169. 68 GOPS and 244. 55 GOPS for RepLKNet-31 and PyConvResNet-50, respectively, both of which are implemented on hardware for the first time.