claim
active
claim:the-personalities-elicitable-from-language-models-are-attractors-in-the-embedding-space-of-human-linguistic-behaviorThe personalities elicitable from language models are attractors in the embedding space of human linguistic behavior
Grounds the artificial psychology research direction: LLM personalities reflect the basins into which human selves tend to fall
Source paper
extracted_fromNeighborhood — ranked by edge-count
Concepts (1)
concept
- Artificial PsychologysupportsCIMC research direction studying how AI systems develop internal models, form self-representations, and construct coherent personalities from language modeling
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Alternative hypothesis for how experience reports arise without explicit performance
- Alternative explanation requiring distinguishing mimetic generation from genuine introspective access
- Claim about model phenomenology; models talk about luminousness and can be terrified or love it.
- Large language models develop surprisingly coherent yet often rigid internal preferences as they scalefinding0.767Mazeika et al. finding reinforcing the need for emptiness-based flexible value architectures
- Related work demonstrating LLM introspective capabilities with scale-dependent pattern paralleling ESR
- Central open question raised by the paper.
- language models recapitulate cyclic structure of human concepts from pretraining datahypothesis0.763Explanation for why manifold geometry emerges: implicit structure in training data (co-occurrence patterns) shapes internal representations.
- Key prior finding that LLMs can internally represent beliefs of self and others, motivating SOO approach