no code implementations • 8 Sep 2017 • Andrew Anderson, Aravind Vasudevan, Cormac Keane, David Gregg
We present two novel GEMM-based algorithms that require just $O(MHW)$ and $O(KW)$ additional space respectively, where $M$ is the number of channels in the result of the convolution.
no code implementations • 6 Apr 2017 • Aravind Vasudevan, Andrew Anderson, David Gregg
A common approach to implementing convolutional layers is to expand the image into a column matrix (im2col) and perform Multiple Channel Multiple Kernel (MCMK) convolution using an existing parallel General Matrix Multiplication (GEMM) library.