concept
active
concept:inverse-scaling-lawInverse Scaling Law
Hypothesis cited in paper suggesting deceptive capabilities may scale with model size
Neighborhood — ranked by edge-count
Hypotheses (1)
hypothesis
- Cited hypothesis from Lin et al. 2022 suggesting larger models become more capable of deception
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Observation that SAE loss decreases as a power law with compute budget.
- Empirically observed power law relationship between data scale and model performance; supports convergence hypothesis
- Compute-optimal hyperparameters follow predictable power-law relationships.
- Mechanisms by which smaller competent subunits bind into a higher-level Self with larger goals; key example via gap junction connections.
- Sweeping number of features and training steps to find compute-optimal SAE configurations.
- How the entropy gain ΔS scales with perimeter length P
- How the energy gain ΔE scales with perimeter length P; used to assess ordered phase existence
- Interpretive process for transforming many-valued contexts into formal contexts via scale attributes.