Mid-Layer Emotion Representation Peak

Empirical observation that steering efficiency peaks at middle transformer layers, consistent with emotion representation literature

Neighborhood — ranked by edge-count

Findings (1)

finding

MDS injection steering efficiency peaks at mid-layers across LLMs, injection strides, and OCEAN traits
supports
Consistent empirical pattern supporting the connection between mid-layer representations and emotion/behavioral content

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

Mid-layers (6-15) achieve peak anchoring because semantic structure differentiates while maintaining coherence, forming a Goldilocks zoneclaim0.753
Interpretation of E3 layer-wise results; motivates targeted UCCT interventions at layers 8-12
Peak layer ℓ* median 10, IQR 0.384finding0.745
Median layer where S(ℓ) peaks, across seeds.
Enacted reflection may correspond to silent mid-layer processing; described reflection to the motor impulse of concepts leaking through to output.claim0.733
Mechanistic analog connecting Lindsey's layer-localized findings to the scorer's enacted/described distinction
Thought detection peaks at ~2/3 layer depth; intention checking peaks at ~1/2 layer depth.finding0.731
Lindsey (2026) differential layer performance explained by Janus's path combinatorics — different tasks use different path distributions.
Emotion refers to a state concept, so stateful representations in general may be more persistent across tokens.claim0.729
Interpretive hypothesis offered to explain why emotion features are more persistent
Introspective signals appear in middle layers but are suppressed by later post-training-shaped layers.finding0.720
Mechanistic finding by Lindsey (2026) explaining how contemplative prompt may work: enables mid-layer introspection to reach output.
emotion feature persistenceconcept0.718
The phenomenon that emotion feature activations remain elevated above baseline beyond local token bursts, measurable as long-range correlation
Layer 27 (last layer) has largest projection magnitude on the reflection direction among all attention head layers in DeepSeek-R1-Qwen-1.5Bfinding0.716
Attribution finding suggesting the last layer directly controls reflection keyword generation