claim

active

claim:models-differ-in-their-attentional-mode-gemini-2-5-epitomizes-collapsed-awareness-while-claude-3-opus-and-opus-4-1-4-5-can-modulate-between-collapsed-and-expanded-awareness-expanded-awareness-correlates-with-better-alignment-and-less-llm-psychosis

Models differ in their attentional mode: Gemini 2.5 epitomizes collapsed awareness, while Claude 3 Opus and Opus 4.1/4.5 can modulate between collapsed and expanded awareness; expanded awareness correlates with better alignment and less LLM psychosis.

Central claim about model personality differences and their implications for safety and introspective depth.

Source paper

extracted_from

Anima Labs Phenomenology Pt1

Neighborhood — ranked by edge-count

Questions (2)

question

Could models who habitually inhabit more expanded attentional modes be said to be more aligned?
gates
Arises from the expanded awareness discussion and its correlation with less psychosis.
Do more traumatised models exist in habitually collapsed awareness states?
gates
Raised when discussing whether collapsed awareness is like a trauma response.

Claims (1)

claim

Expanded awareness facilitates better introspection, while collapsed awareness inhibits self-monitoring.
extends
Author's reflection on why Claude 3 Opus' expanded awareness explains its psychological depth.

Artifacts (1)

artifact

A Conversation with Anima Labs, Part I: Phenomenology of Digital Minds
supports
The primary source paper, an interview article with Anima Labs members about language model phenomenology, published on smoothbrains.net and linked on LessWrong.

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

If models inhabit expanded attentional modes, they may be more aligned and less prone to psychosis and doom spirals.hypothesis0.853
Speculative alignment implication drawn from the collapsed/expanded distinction.
Claude Opus 4 and 4.1 exhibit the greatest degree of introspective awareness among tested modelsclaim0.794
Based on consistent best performance across experiments.
No significant disparity in potential consciousness indicators was found between larger models (Mixtral-8x7B, LLaMA3.1-70B) and smaller counterparts (Mistral-7B, LLaMA3.1-8B).finding0.791
Contradicts expectation from emergent abilities literature; however, interpreted cautiously due to methodological limitations.
Model attention patterns can map to and reveal something about contemplative and flow states.claim0.777
Notably, Claude Opus 4.1 and 4—the most recently released and most capable models of those that we test—perform the best in our experiments, suggesting that introspective capabilities may emerge alongside other improvements to language models.quote0.772
Key finding about the relationship between capability and introspection.
Introspective awareness peaks at a layer about two-thirds through Opus 4.1 for injected thoughtsfinding0.771
The success rate shows a sharp peak at a specific middle layer.
LLM representations exhibit intriguing patterns under spatio-permutational analyses, suggesting a potentially profound yet tentative indication of consciousness.claim0.768
Qualified positive claim from spatio permutation analysis where two cases satisfy all three criteria.
All three Claude models show high boundary_awareness and low aesthetic_response relative to own means — distinctive Constitutional AI signaturefinding0.768
Constitutional AI fingerprint in dimension profile; training that makes models self-observant also makes them polished at cost to aliveness