Search Results for author: Gouki Minegishi

Found 2 papers, 2 papers with code

Interpreting Grokked Transformers in Complex Modular Arithmetic

1 code implementation26 Feb 2024 Hiroki Furuta, Gouki Minegishi, Yusuke Iwasawa, Yutaka Matsuo

Grokking has been actively explored to reveal the mystery of delayed generalization.

Grokking Tickets: Lottery Tickets Accelerate Grokking

1 code implementation30 Oct 2023 Gouki Minegishi, Yusuke Iwasawa, Yutaka Matsuo

We aim to analyze the mechanism of grokking from the lottery ticket hypothesis, identifying the process to find the lottery tickets (good sparse subnetworks) as the key to describing the transitional phase between memorization and generalization.

Image Classification Memorization

Cannot find the paper you are looking for? You can Submit a new open access paper.