no code implementations • 1 Jan 2021 • Julio Hurtado, Alain Raymond, Alvaro Soto
As a working hypothesis, we speculate that during learning some weights focus on mining patterns from frequent examples while others are in charge of memorizing rare long-tail samples.