claim
active
claim:reinforcement-learning-acting-on-individual-characteristics-affecting-their-connections-to-others-can-result-in-dynamics-that-are-equivalent-to-unsupervised-learning-at-the-system-scaleReinforcement learning acting on individual characteristics affecting their connections to others can result in dynamics that are equivalent to unsupervised learning at the system scale.
Key insight linking individual rewards to system-level learning.
Source paper
extracted_from(2023) · Watson, Richard · Levin, Michael
Neighborhood — ranked by edge-count
Claims (1)
claim
- Explains how collective cognition becomes irreducible to parts.
Questions (1)
question
- Question linking reward scaling to collective problem-solving improvement.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- §3 Discussion.
- Method for fine-tuning LMs based on human preferences; mentioned as combining RL and LMs.
- Empirically grounded claim citing Perez et al. 2022, showing RLHF can backfire on the self-preservation dimension
- Central claim about the power of connectionism.
- Clarifies what unsupervised learning does.
- Secondary empirical result: CE-based representational changes correlate with task success.
- Argument that RL meets the agency indicator.