hypothesis
active
hypothesis:capacity-hypothesisCapacity Hypothesis
Bigger models are more likely to converge to a shared representation than smaller models because they can better approximate the global optimum
Source paper
extracted_from(2024) · Minyoung Huh · Brian Cheung · Tongzhou Wang · Phillip Isola
Neighborhood — ranked by edge-count
Papers (1)
paper
- The Platonic Representation Hypothesisintroduces
Findings (1)
finding
- Key empirical finding establishing that representational alignment correlates with model competence
Concepts (1)
concept
- Platonic RepresentationsupportsThe hypothesized converged representation that all sufficiently large AI models are approaching — a statistical model of underlying reality
Hypotheses (2)
hypothesis
- Multitask Scaling Hypothesisassociated_withArgues that there are fewer representations competent for N tasks than M<N tasks, so more general models have a smaller solution space
- Simplicity Bias Hypothesisassociated_withDeep networks are biased toward finding simple fits to data, and this bias increases with model size, driving convergence
Quotes (1)
quote
- The paper's central thesis statement, presented prominently after the abstract
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The claim in RL that any goal can be expressed as maximizing the expected cumulative sum of a scalar reward signal.
- The cognitive ability to attribute mental states to self and others; used as a proxy domain for investigating consciousness in LLMs.
- The conjecture that consciousness does not result from the organized mind but creates and maintains complex models of reality; forms at the beginning of mental development
- The hypothesis that analogous features and circuits reliably form across different neural network models and tasks
- Normative theory proposing biological systems perform approximate Bayesian inference through free energy minimization.
- CIMC's central hypothesis: general computational machines with sufficient resources possess the necessary and sufficient means to implement consciousness, verifiable through internal structure analysis
- The overarching hypothesis that an I or self-like ground underlies matter and becomes visible in living things.
- Hypothesis that neurons form privileged bases to encode information; consistent with constructive abstraction