hypothesis
active
hypothesis:if-loss-keeps-going-down-on-the-test-set-in-the-limit-the-model-must-be-learning-to-interpret-and-predict-all-patterns-represented-in-language-including-common-sense-reasoning-goal-directed-optimization-and-deployment-of-the-sum-of-recorded-human-knowledgeIf loss keeps going down on the test set, in the limit the model must be learning to interpret and predict all patterns represented in language, including common-sense reasoning, goal-directed optimization, and deployment of the sum of recorded human knowledge.
Extrapolation of scaling predictive models to AGI.
Source paper
extracted_fromNeighborhood — ranked by edge-count
Concepts (1)
concept
- GPT (Generative Pre-trained Transformer)associated_withA family of large language models trained on next-token prediction, central example of simulators.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- §3 Discussion.
- The benchmark’s diagnostic value lies in identifying why a model loses, not just that it losesclaim0.787argues for fine-grained behavioral analysis over aggregate rankings
- Acknowledges the confound of not explicitly instructing models to track wealth, yet points to reasoning gaps given code agents avoid errors without prompts.
- Describes scaffolding method and the model's meta-learning loop.
- Key limitation of the PRH for non-bijective observations
- Based on informal audience experiments; implies people use prior knowledge about rule structure
- Core definitional quote for performative chain-of-thought
- Prediction orthogonality thesis.