method
active
method:kv-cachingKV caching
Caching of key-value pairs to avoid recomputation; also provides a mechanism for introspection of earlier computations.
Neighborhood — ranked by edge-count
Papers (1)
paper
Concepts (1)
concept
- IntrospectionimplementsThe ability of a model to observe its own past internal states or computations; claimed to be architecturally permitted by transformers.
Artifacts (1)
artifact
- Original thread by janus explaining transformer information highways and introspection capabilities, posted on X.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Janus's claim about KV caching as an introspection mechanism.
- The key-value cache from steered tokens is retained during no-steering continuation, allowing causal effect of steering to propagate
- Proposed pathway flowing across positions at each layer; carries key, value, and attention-weighted information horizontally.
- A process by which vasomotion collapses ambivalent neural patterns into durable definite states, reducing complexity.
- The causal steering experiment persists KV state over steered tokens so downstream effects can be observed without continued steering
- Clamping activations along the Assistant Axis to remain above a minimum threshold (25th percentile), introduced as a stabilization method
- Unsupervised probe by Burns et al. to predict latent truth representations; cited as related but limited in generalization