hypothesis
active
hypothesis:scaling-model-size-as-well-as-data-and-task-diversity-drives-representational-convergence-toward-the-platonic-representationScaling model size, as well as data and task diversity, drives representational convergence toward the platonic representation
Core mechanism hypothesis connecting PRH to the empirical trend of scaling in AI
Source paper
extracted_from(2024) · Minyoung Huh · Brian Cheung · Tongzhou Wang · Phillip Isola
Neighborhood — ranked by edge-count
Findings (1)
finding
- Core cross-modal empirical result: larger and better language models align better with vision models
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Bigger models are more likely to converge to a shared representation than smaller modelshypothesis0.834Selective pressure toward convergence via model capacity
- Key limitation of the PRH for non-bijective observations
- Implication of PRH: larger models should amplify bias less and hallucinate less if they better model reality
- Interpretation of weaker PCA separation and lower ASR in smaller models
- Implication of PRH for training practice: both modalities point at the same underlying reality
- Interpretive claim connecting scale to abstraction level in LLM representations
- How do representations differ or converge between architectures, tasks, and modalities?question0.786Broader research question MAS is positioned to address, citing multiple recent works.