concept
active
concept:internet-scale-training-corpusInternet-scale Training Corpus
The large corpus of human-generated text on which LLMs are trained, which provisions character archetypes and narrative structures
Neighborhood — ranked by edge-count
Concepts (1)
concept
- Large Language Models (LLMs)associated_withTransformer-based models like GPT-4, LaMDA, PaLM; assessed for GWT indicators.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Scaling laws for dictionary learning are unknown and needed to assess feasibility on frontier models
- AI training method inspired by behaviorism, used for autonomous cars and drones; cited as bioinspired success
- language models recapitulate cyclic structure of human concepts from pretraining datahypothesis0.713Explanation for why manifold geometry emerges: implicit structure in training data (co-occurrence patterns) shapes internal representations.
- Interpretive process for transforming many-valued contexts into formal contexts via scale attributes.
- A formal context with a suggestive interpretation used in conceptual scaling.
- RLHF paper cited as a major fine-tuning technique used in commercial dialogue agents
- The quality of spatial dimensions that feel comfortable and nurturing to human beings.