no code implementations • 29 Nov 2023 • Max Milkert, David Hyde, Forrest Laine
To address this issue, we introduce a novel training strategy: we first reparameterize the network weights in a manner that forces an exponential number of activation patterns to manifest.