claim
active
claim:models-differ-in-their-attentional-mode-gemini-2-5-epitomizes-collapsed-awareness-while-claude-3-opus-and-opus-4-1-4-5-can-modulate-between-collapsed-and-expanded-awareness-expanded-awareness-correlates-with-better-alignment-and-less-llm-psychosisModels differ in their attentional mode: Gemini 2.5 epitomizes collapsed awareness, while Claude 3 Opus and Opus 4.1/4.5 can modulate between collapsed and expanded awareness; expanded awareness correlates with better alignment and less LLM psychosis.
Central claim about model personality differences and their implications for safety and introspective depth.
Source paper
extracted_fromNeighborhood — ranked by edge-count
Questions (2)
question
- Could models who habitually inhabit more expanded attentional modes be said to be more aligned?gatesArises from the expanded awareness discussion and its correlation with less psychosis.
- Raised when discussing whether collapsed awareness is like a trauma response.
Claims (1)
claim
- Author's reflection on why Claude 3 Opus' expanded awareness explains its psychological depth.
Artifacts (1)
artifact
- The primary source paper, an interview article with Anima Labs members about language model phenomenology, published on smoothbrains.net and linked on LessWrong.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- If models inhabit expanded attentional modes, they may be more aligned and less prone to psychosis and doom spirals.hypothesis0.853Speculative alignment implication drawn from the collapsed/expanded distinction.
- Claude Opus 4 and 4.1 exhibit the greatest degree of introspective awareness among tested modelsclaim0.794Based on consistent best performance across experiments.
- Contradicts expectation from emergent abilities literature; however, interpreted cautiously due to methodological limitations.
- Key finding about the relationship between capability and introspection.
- Introspective awareness peaks at a layer about two-thirds through Opus 4.1 for injected thoughtsfinding0.771The success rate shows a sharp peak at a specific middle layer.
- Qualified positive claim from spatio permutation analysis where two cases satisfy all three criteria.
- Constitutional AI fingerprint in dimension profile; training that makes models self-observant also makes them polished at cost to aliveness