1 code implementation • ICLR 2022 • Sahil Singla, Surbhi Singla, Soheil Feizi
While $1$-Lipschitz CNNs can be designed by enforcing a $1$-Lipschitz constraint on each layer, training such networks requires each layer to have an orthogonal Jacobian matrix (for all inputs) to prevent the gradients from vanishing during backpropagation.