concept
active
concept:pointer-arithmetic-in-transformersPointer Arithmetic in Transformers
A mechanism in models with positional embeddings where attention heads can manipulate and copy positional information to implement algorithms like induction heads
Neighborhood — ranked by edge-count
Concepts (1)
concept
- Rotary Positional Embedding (RoPE)associated_withA positional encoding mechanism that doesn't put positional information into the residual stream; changes which pointer arithmetic algorithms are available
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Prior finding from Grant et al. 2025 used to interpret low MAS IIA for GRU-Transformer hidden state comparisons.
- Spontaneously reported experience of multiple simultaneous processing streams, observed even in base models.
- Transformers almost surely maintain input-injectivity throughout training, not just at initialisationhypothesis0.708Conjecture supported by Nikolaou et al. 2025 for last-token hidden states
- Foundational mechanistic interpretability paper on transformer circuit analysis
- Prior Anthropic paper enabling circuit-level analysis of attention-only transformers; motivates current MLP decomposition
- Evidence that in-context learning is not mere pattern matching but genuine optimization, relevant to applying the thesis to inference
- Lindsey (2026) found that intention checking accuracy peaks around half the network depth.