framework
active
framework:instrumental-convergenceInstrumental convergence
The thesis that sufficiently advanced agents will converge to similar subgoals.
Neighborhood — ranked by edge-count
Claims (1)
claim
- Argues against instrumental convergence in GPT.
Artifacts (1)
artifact
- Simulators (LessWrong post)mentionsThe paper being extracted.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The central empirical phenomenon: different neural networks trained on different data/objectives develop increasingly similar representations
- The process by which individuals, through refining their ability to perceive living structure, gradually come to agree on aesthetic and life judgments.
- Operational criterion for strategic deception: CoT steps demonstrate causal link between deception and goal achievement
- Anton's synthesis, referenced as connected to this work
- What has led to representational convergence, will it continue, and ultimately where does it end?question0.705Central motivating questions of the paper
- Key property enabling organized collective action; mechanism by which parts coordinate into functional wholes.
- Core research questions motivating the paper
- Philosophy of science position that science converges on truth; cited as precursor to the platonic representation hypothesis