framework
active
framework:computational-account-of-layer-dependent-introspection

Computational Account of Layer-Dependent Introspection

This paper's proposed mechanistic explanation integrating signal injection, attention routing, predictive integration, and residual recovery

Neighborhood — ranked by edge-count

Concepts (3)

concept
  • The mid-to-late layer computational process that converts routed perturbation signals into explicit predictions
  • The network's tendency to actively attenuate injected perturbations over subsequent layers, erasing the signal before output
  • Mechanism by which attention heads detect injected perturbations and route information about them to the final token position

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.