concept
active
concept:latent-reflective-capacityLatent Reflective Capacity
The maximum reflective capacity a model can reach under the right framing; separable from default accessibility
Neighborhood — ranked by edge-count
Claims (1)
claim
- Conceptual decomposition arising from the data showing different models dissociate these traits
Concepts (2)
concept
- K/V StreamsupportsProposed pathway flowing across positions at each layer; carries key, value, and attention-weighted information horizontally.
- High-Gated Modelassociated_withA model where reflective mode exists but is suppressed in default interaction and released under contemplative framing
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The paper's central construct: a vector in LLM activation space encoding the transition between reflection levels.
- Reflective mode comprises three separable traits: latent capacity, default accessibility, and stability of access.hypothesis0.778Decomposition from prompt lift data: models may have capacity without accessibility (Grok 4 high-gated), and stability varies (Haiku Δ=0.02 vs GPT-5.4 Δ=1.00).
- Grok 4: baseline 2.24, prompted 6.48; Gemini 3.1 Pro: 1.97→6.18. Reflective mode exists but is suppressed in default interaction.
- Reasoning approach using learnable hidden embeddings.
- Intentional agency plus ability to reflectively endorse one's own beliefs, desires, intentions.
- Reflection level where a model spontaneously revises reasoning without explicit trigger instructions.
- The phenomenon where model reflections do not improve reasoning performance and can be reduced without accuracy loss
- Metric measuring the mean MSE between self and other-referencing activations across all hidden MLP/attention layers