concept
active
concept:emotion-geometry-in-llm-activationsEmotion geometry in LLM activations
Emotion emerges early, peaks in middle layers, sharpens with scale, and persists across tokens in LLM activations per Zhang & Zhong 2025
Neighborhood — ranked by edge-count
Papers (1)
paper
Thinkers (1)
thinker
- Jingxiang ZhangintroducesShowed emotion is geometrically structured in LLM activations; foundational characterization of emotion geometry
Concepts (1)
concept
- Emotive states in LLMsassociated_withDirections in activation space associated with contrastive emotive concept pairs studied in this paper as targets for introspection
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Internal representations encoding emotion concepts in large language models, identified by probing and SAE methods
- Question raised by Anthropic and partially addressed by this paper's persistence evidence
- Mechanism by which activation of an emotion feature sometimes leads to later suppression of that same featurequestion0.748Identified research gap: the paper observes anti-persistence but has no explanation for it
- Key empirical result showing that optimizing for behavioral outputs and fitting representation geometry produce the same path in activation space.
- Rich geometric structure carried by neural representations.
- The finding that interpretable concepts including character traits are encoded as linear directions in transformer residual streams