Representational familiarity

How familiar a model is with a numeral system, manipulated via bases in Experiment 2.

Neighborhood — ranked by edge-count

method

E2: Numeral-Base Arithmetic Controlled Study
studies
Quantitative study varying representational familiarity via numeral bases B10/B8/B9 at fixed computational complexity

concept

Base-10 arithmetic
associated_with
High pretraining exposure numeral system used in E2.
Base-8 arithmetic
associated_with
Moderate pretraining exposure numeral system used in E2.
Base-9 arithmetic
associated_with
Low pretraining exposure numeral system used in E2.

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Representational Transparencyconcept0.810
Property of conscious representations: they do not contain information about the fact that they are representations at the level of the representation itself
Representational Convergenceconcept0.785
The central empirical phenomenon: different neural networks trained on different data/objectives develop increasingly similar representations
Representational Alignmentconcept0.784
Measure of similarity between the similarity structures (kernels) induced by two different representations
Representational Divergenceconcept0.782
Core phenomenon studied: when causal interventions shift internal representations away from the natural distribution
Representational Honestyconcept0.780
The proposed domain-general property indexed by deception features that governs both factual accuracy and experiential self-report
representational mismatch drconcept0.777
Distance between prior and target representations.
Representational dynamicsconcept0.774
The evolution of an agent's latent representations over the course of training, shown to align with reward improvement when causal emergence is high.
representation manifoldconcept0.763
One-dimensional curved surface in internal activation space; the paper demonstrates alignment with behavior manifold.