Representational Embedding Spaces

Internal structure of AI systems that CIMC proposes to analyze interpretively to evaluate consciousness hypotheses

Neighborhood — ranked by edge-count

method

Dictionary Learning for Neural Network Interpretability
supports
Bricken et al.'s method for decomposing language models into interpretable features; cited as AI alignment interpretability relevant to consciousness detection

concept

Interpretive Validation
supports
CIMC's methodology for evaluating whether a built system is conscious: combining multiple forms of evidence including predicted functional organization and developmental trajectories

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Vector Embedding Representationconcept0.831
The specific type of representation studied in the paper: function f: X→R^n assigning feature vectors to inputs
Latent-Space Representationsconcept0.764
Substrate on which causal emergence was computed across agent lifetimes; aligned with learning success.
Representational Transparencyconcept0.747
Property of conscious representations: they do not contain information about the fact that they are representations at the level of the representation itself
Representational Disentanglementconcept0.746
CIMC's characterization of part of the solution to the Hard Problem: insight into the structural necessities of phenomenal representation
Representational dynamicsconcept0.746
The evolution of an agent's latent representations over the course of training, shown to align with reward improvement when causal emergence is high.
Structure in representationsconcept0.746
The central question of whether representational geometry implies corresponding computational structure
Orthogonal Decomposition of Representation Spaceconcept0.745
Mathematical structure central to distributed interchange interventions; representation space decomposed into orthogonal subspaces each aligned with a high-level variable.
representation manifoldconcept0.745
One-dimensional curved surface in internal activation space; the paper demonstrates alignment with behavior manifold.