no code implementations • 31 Jan 2019 • Mayank Sharma, Aayush Yadav, Sumit Soman, Jayadeva
We show that $L_2$ regularization leads to a simpler hypothesis class and better generalization followed by DARC1 regularizer, both for shallow as well as deeper architectures.