no code implementations • 2 Dec 2023 • Juno Kim, Jaehyuk Kwon, Mincheol Cho, Hyunjong Lee, Joong-Ho Won
In this paper, we explore the use of heavy-tailed models to combat over-regularization.
Decoder