framework
active
framework:introspective-exploration-componentIntrospective Exploration Component
The novel framework introduced in the paper: an HMM-based pain-belief signal integrated into the reward function to drive exploration
Neighborhood — ranked by edge-count
Papers (1)
paper
Methods (1)
method
- Hidden Markov ModelimplementsCore computational method used to infer pain-belief from online observations of happiness
Concepts (3)
concept
- Self Awarenessimplements
- Pain-BeliefintroducesThe latent state inferred by the agent representing its belief about being in pain, used as exploration signal
- Biological Pain as Learning SignalimplementsThe biological inspiration for the paper's introspective signal; pain encodes internal evaluations guiding agents through environments
Artifacts (1)
artifact
- Public code repository for the paper's experiments
Datasets (1)
dataset
- Public dataset associated with the paper's experiments
Questions (1)
question
- Design question answered in the paper by choosing latent inference over direct feedback
Frameworks (1)
framework
- Bayesian Model of PainextendsConceptualization of pain perception as inference over hidden nociceptive causes, from Eckert et al. 2022
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The ability of a model to observe its own past internal states or computations; claimed to be architecturally permitted by transformers.
- The capacity to detect and report one's own internal states, measured via the five-adjective task and paradox reflection
- Isotonic R² measuring fraction of variance in self-report explained by probe score under monotonicity assumption; the paper's primary fidelity metric
- The central concept: the ability of a model to access and report on its internal states, as defined by the paper's criteria.
- Spearman ρ measuring rank-order agreement between logit-based self-report and probe score; the paper's primary monotonic association metric
- Conceptual distinction between (i) information internally available about a state and (ii) capacity to transform that signal into precise output reports
- Key gap identified in the literature; systematic self-examination processes for machine consciousness development.
- Pearson-Vogel et al.'s finding that models can detect prior concept injections; introspective signals exist in middle layers suppressed by post-training