concept
active
concept:causal-informational-couplingCausal informational coupling
Operational definition of introspection: self-report covaries monotonically with probe-defined direction AND causally shifting activations shifts the report in a semantically coherent way
Neighborhood — ranked by edge-count
Thinkers (1)
thinker
- Iulia-Maria ComsaintroducesArgued genuine introspection requires causal connection between internal state and report; provided theoretical framing adopted by this paper
Frameworks (1)
framework
- Quantitative Introspection FrameworkimplementsThe paper's central contribution: treating LLM numeric self-report as a quantitative signal validated against probe-defined internal states with causal confirmation via steering
Concepts (2)
concept
- Introspective fidelityimplementsIsotonic R² measuring fraction of variance in self-report explained by probe score under monotonicity assumption; the paper's primary fidelity metric
- Introspective strengthimplementsSpearman ρ measuring rank-order agreement between logit-based self-report and probe score; the paper's primary monotonic association metric
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Maturana's concept of continuous dynamic interaction between an organism and its environment; key to understanding living cognition.
- Function determining the value of a variable based on its causal parents in an acyclic causal model.
- Property that causal mechanisms remain stable across environments; desirable for OOD.
- Whether an internal direction causally controls a target behavior, verified by intervention success
- Emergent causation where macro-variable has causal influence on its own future independently of micro-states.
- A framework the paper uses alongside feature geometry to deepen mechanistic understanding of LMs
- Hippocampal oscillatory phenomenon reproduced by active inference; phase-amplitude coupling.