concept
active
concept:grokkingGrokking
Observed in IOI alignment map training where IIA stays low for many steps then quickly jumps
Neighborhood — ranked by edge-count
Concepts (1)
concept
- Alignment Map (ϕ)associated_withThe bijective function mapping DNN inner neurons to latent variables in causal abstraction; its complexity is the central variable studied