no code implementations • ICLR 2018 • Amy Nesky, Quentin Stout
We make contributions to this issue by considering a modified version of the fully connected layer we call a block diagonal inner product layer.