concept
active
concept:grokking

Grokking

Observed in IOI alignment map training where IIA stays low for many steps then quickly jumps

Neighborhood — ranked by edge-count

Concepts (1)

concept
  • Alignment Map (ϕ)
    associated_with
    The bijective function mapping DNN inner neurons to latent variables in causal abstraction; its complexity is the central variable studied