Ongoing Research
Investigating Grokking as a Phase Transition under the SETOL framework and Latent Dunning-Kruger Dynamics
(WIP) 1. Introduction Grokking is a delayed yet sudden leap in generalization performance, often observed in neural networks long after they have perfectly memorized the training set. Recent studies indicate two compelling perspectives: 1. Spectral Phase Transition: Grokking arises from heavy-tailed self-regularization (HTSR), where the network’s eigenvalue spectrum evolves