KV State Persistence

The key-value cache from steered tokens is retained during no-steering continuation, allowing causal effect of steering to propagate

Neighborhood — ranked by edge-count

method

5-Token Steering Pulse Experiment
uses
Applies a 5-token steering pulse to each emotion probe and measures persistence of causal effect via contrast z-score over 200 subsequent tokens

concept

KV state persistence across steered tokens
related_to
The causal steering experiment persists KV state over steered tokens so downstream effects can be observed without continued steering

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Emotional State Persistenceconcept0.794
The property of emotion features maintaining elevated activation well beyond the local token context that triggered them
KV caching overcomes statelessness and provides a mechanism for introspection of computations at earlier token positions.claim0.744
Janus's claim about KV caching as an introspection mechanism.
KV cachingmethod0.724
Caching of key-value pairs to avoid recomputation; also provides a mechanism for introspection of earlier computations.
emotion feature persistenceconcept0.723
The phenomenon that emotion feature activations remain elevated above baseline beyond local token bursts, measurable as long-range correlation
residual persistenceconcept0.723
Emotion feature persistence above and beyond the persistence expected from high variance explained alone, computed by subtracting median variance-matched probe persistence
The persistence paradoxquestion0.717
Core logical puzzle: if an agent does not change, it dies; if it changes, the self ceases to exist. Applies to all scales from organelles to evolutionary lineages.
autoregressive persistenceconcept0.712
Baseline persistence of any probe direction arising from the autoregressive nature of LLMs, not specific to emotion content
Persistence Paradoxconcept0.705
Core logical paradox: if a species fails to change it dies; if it changes, it ceases to exist. Same applies to individuals.