no code implementations • 4 Feb 2024 • Oria Gruber, Haim Avron
In this work, we focus on investigating the implicit bias originating from weight initialization.