Representational Convergence

The central empirical phenomenon: different neural networks trained on different data/objectives develop increasingly similar representations

Neighborhood — ranked by edge-count

paper

thinker

concept

Representational Alignment
associated_with
Measure of similarity between the similarity structures (kernels) induced by two different representations
Anna Karenina Scenario
extends
Hypothesis that all well-performing neural nets represent the world in the same way; PRH extends this by specifying what representation they converge to
Simplicity Bias
supports
The tendency of deep networks to implicitly favor simpler solutions that fit the data, driving convergence
Multitask Scaling
supports
The pressure on models trained on more tasks to find representations that generalize across all tasks, reducing the solution space
Sociological Bias in AI Development
supports
Researcher preferences and goals of mimicking human reasoning shape model development, potentially causing convergence toward human-like representations

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

What exactly is the endpoint of representational convergence?question0.857
Motivates Section 4 where the PMI-kernel formalization is proposed
What has led to representational convergence, will it continue, and ultimately where does it end?question0.853
Central motivating questions of the paper
Representational Divergenceconcept0.844
Core phenomenon studied: when causal interventions shift internal representations away from the natural distribution
Representational Failureconcept0.798
A failure mode exposed by the SAE framework where model representations are entangled or collapse under intervention
Representational dynamicsconcept0.786
The evolution of an agent's latent representations over the course of training, shown to align with reward improvement when causal emergence is high.
Representational familiarityconcept0.785
How familiar a model is with a numeral system, manipulated via bases in Experiment 2.
representational driftconcept0.778
Accumulation of mismatch in later layers causing S degradation.
Representational Transparencyconcept0.769
Property of conscious representations: they do not contain information about the fact that they are representations at the level of the representation itself