concept
active
concept:causal-informational-coupling

Causal informational coupling

Operational definition of introspection: self-report covaries monotonically with probe-defined direction AND causally shifting activations shifts the report in a semantically coherent way

Neighborhood — ranked by edge-count

Thinkers (1)

thinker
  • Argued genuine introspection requires causal connection between internal state and report; provided theoretical framing adopted by this paper

Frameworks (1)

framework
  • The paper's central contribution: treating LLM numeric self-report as a quantitative signal validated against probe-defined internal states with causal confirmation via steering

Concepts (2)

concept
  • Isotonic R² measuring fraction of variance in self-report explained by probe score under monotonicity assumption; the paper's primary fidelity metric
  • Spearman ρ measuring rank-order agreement between logit-based self-report and probe score; the paper's primary monotonic association metric

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

  • Maturana's concept of continuous dynamic interaction between an organism and its environment; key to understanding living cognition.
  • Causal Mechanismconcept0.765
    Function determining the value of a variable based on its causal parents in an acyclic causal model.
  • Causal Invarianceconcept0.765
    Property that causal mechanisms remain stable across environments; desirable for OOD.
  • Causal Mediationconcept0.760
    Whether an internal direction causally controls a target behavior, verified by intervention success
  • Causal Decouplingconcept0.759
    Emergent causation where macro-variable has causal influence on its own future independently of micro-states.
  • Causal abstractionconcept0.756
    A framework the paper uses alongside feature geometry to deepen mechanistic understanding of LMs
  • Hippocampal oscillatory phenomenon reproduced by active inference; phase-amplitude coupling.