hypothesis

active

hypothesis:there-are-fewer-representations-competent-for-n-tasks-than-m-n-tasks-so-training-more-general-models-should-yield-fewer-possible-solutions

There are fewer representations competent for N tasks than M<N tasks, so training more general models should yield fewer possible solutions

Selective pressure toward convergence via task generality

Source paper

extracted_from

The Platonic Representation Hypothesis

(2024) · Minyoung Huh · Brian Cheung · Tongzhou Wang · Phillip Isola

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Bigger models are more likely to converge to a shared representation than smaller modelshypothesis0.819
Selective pressure toward convergence via model capacity
Earlier/less capable models exhibit a larger gap between think and don't think representation strengthfinding0.815
Claude 3 models show a bigger difference than newer models like Opus 4.1.
Software implementations for all of the models/behaviours presented are common for n = 2, and can be made very efficient for α_i that map many objects onto a much smaller set of object families.claim0.805
Claim about current practical feasibility and efficiency of 2-way associative implementations.
For a given task, the number of all sequences which work is tiny by comparison with the huge number of all possible sequences; less than a trillionth of all 6 × 10^23 possible sequences actually work well enough.claim0.801
A combinatorial argument that good sequences are astronomically rare, emphasizing the difficulty of discovery.
Fewer concepts must be learned if all tools share a unified underlying modelclaim0.797
Kay argues that presenting draw, spreadsheet, and text as instances of the same rectangle/rule abstraction reduces cognitive load versus separate systems.
MAS reveals that numeric representations differ between GRUs trained on Multi-Object, Rounding, and Modulo tasksfinding0.793
Case study showing MAS can compare specific causal information types across models trained on different tasks.
How do representations differ or converge between architectures, tasks, and modalities?question0.793
Broader research question MAS is positioned to address, citing multiple recent works.
Models more effective at recognizing abstract nouns than other concept typesfinding0.792
Opus 4.1 demonstrates highest introspective awareness on abstract nouns (justice, peace, betrayal) with nonzero awareness across all concept categories tested.