concept
active
concept:interpretive-validationInterpretive Validation
CIMC's methodology for evaluating whether a built system is conscious: combining multiple forms of evidence including predicted functional organization and developmental trajectories
Neighborhood — ranked by edge-count
Papers (1)
paper
- cimcWhitepaperintroduces
Methods (1)
method
- CIMC's proposed evaluation methodology: examining what systems build within themselves and inferring to best explanation
Questions (1)
question
- Methodological question driving CIMC's development of interpretive validation over behavioral testing
Hypotheses (1)
hypothesis
- General computational machines with sufficient resources possess the necessary and sufficient means to implement consciousnessassociated_withCIMC's central testable hypothesis grounding the entire research program
Concepts (1)
concept
- Internal structure of AI systems that CIMC proposes to analyze interpretively to evaluate consciousness hypotheses
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The capability to explain model predictions; a central theme of the paper, with disruption profiles as vehicle.
- The historical/hermeneutic approach adopted by the paper to analyze cybernetic diagrams in light of Flusser’s philosophy.
- Method using large language models (Claude) to generate and test explanations of features at scale
- Proposed paradigm for evaluating interpretability work through empirical falsifiability rather than benchmarks or user studies
- Programming technique to restructure a fine-grained Linda program for efficiency by replacing live data structures with passive ones and coarser-grain processes.
- Cases where subspace interventions change model behaviour through parallel pathways rather than the target feature
- Ian Goodfellow quote used to illustrate the pre-paradigmatic state of interpretability research
- Framework of using internal-state representations to control or steer generative models; conceptually parallel to manifold steering in language models.