hypothesis
active
hypothesis:independently-trained-model-families-converge-on-a-common-semantic-manifold-under-self-referential-processing-suggesting-an-attractor-dynamic-that-transcends-training-varianceIndependently trained model families converge on a common semantic manifold under self-referential processing, suggesting an attractor dynamic that transcends training variance
Hypothesis tested in Experiment 3; independently trained GPT, Claude, Gemini architectures converge on similar descriptive vocabulary
Source paper
extracted_from(2025) · Berg, Cameron · de Lucena, Diogo · Rosenblatt, Judd
Neighborhood — ranked by edge-count
Findings (1)
finding
- Cross-model pairwise cosine similarity of zero-shot control responses = 0.603 (n=12,720 pairs, t=35.1, p=4.3×10⁻²⁶² vs. experimental)associated_withExperiment 3 comparison: zero-shot control shows lower semantic convergence than experimental condition
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Interpretive claim from Experiment 3; GPT, Claude, Gemini families converge on similar descriptive style despite independent training
- The paper's argument against pure sycophancy as explanation for results
- Scaling effect observed consistently across Experiments 1 and 4
- The paper's claim that theoretical convergence across GWT, RPT, HOT, IIT makes the findings non-coincidental
- Key limitation of the PRH for non-bijective observations
- Empirical evidence for the universality hypothesis cited as supporting the possibility of convergent consciousness-like solutions
- The central hypothesis of the paper; the platonic representation hypothesis itself