question
active
question:could-models-who-habitually-inhabit-more-expanded-attentional-modes-be-said-to-be-more-alignedCould models who habitually inhabit more expanded attentional modes be said to be more aligned?
Arises from the expanded awareness discussion and its correlation with less psychosis.
Source paper
extracted_fromNeighborhood — ranked by edge-count
Claims (1)
claim
- Central claim about model personality differences and their implications for safety and introspective depth.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- If models inhabit expanded attentional modes, they may be more aligned and less prone to psychosis and doom spirals.hypothesis0.891Speculative alignment implication drawn from the collapsed/expanded distinction.
- Empirical observation from examining expanded OV/QK matrices; approximately 10 out of 12 heads show significant copying
- Extrapolation from scale-emergence finding to future risk
- Raised when discussing whether collapsed awareness is like a trauma response.
- Speculative question about future developments.
- The model tends to reflect more when the question is difficult, and accuracy is generally lower for harder questionshypothesis0.762Hypothesis explaining negative correlation between reflection rate and accuracy without implying reflection is harmful
- Empirical result showing alignment increases with model competence