concept
active
concept:mid-layer-emotion-representation-peakMid-Layer Emotion Representation Peak
Empirical observation that steering efficiency peaks at middle transformer layers, consistent with emotion representation literature
Neighborhood — ranked by edge-count
Findings (1)
finding
- Consistent empirical pattern supporting the connection between mid-layer representations and emotion/behavioral content
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Interpretation of E3 layer-wise results; motivates targeted UCCT interventions at layers 8-12
- Median layer where S(ℓ) peaks, across seeds.
- Mechanistic analog connecting Lindsey's layer-localized findings to the scorer's enacted/described distinction
- Thought detection peaks at ~2/3 layer depth; intention checking peaks at ~1/2 layer depth.finding0.731Lindsey (2026) differential layer performance explained by Janus's path combinatorics — different tasks use different path distributions.
- Interpretive hypothesis offered to explain why emotion features are more persistent
- Introspective signals appear in middle layers but are suppressed by later post-training-shaped layers.finding0.720Mechanistic finding by Lindsey (2026) explaining how contemplative prompt may work: enables mid-layer introspection to reach output.
- The phenomenon that emotion feature activations remain elevated above baseline beyond local token bursts, measurable as long-range correlation
- Attribution finding suggesting the last layer directly controls reflection keyword generation