finding
active
finding:interest-probe-peak-cohen-s-d-1-67-layer-14-p-9-45-10-6-in-llama-3-2-3bInterest probe: peak Cohen's d=1.67 (layer 14), p=9.45×10⁻⁶ in LLaMA-3.2-3B
Probe validation result confirming interest direction captures meaningful structure
Source paper
extracted_from(2026) · Nicolas Martorell · Bianchi, Bruno
Neighborhood — ranked by edge-count
Concepts (2)
concept
- One of four emotive concept probes trained; contrastive pair distracted/focused with best layer 10 in LLaMA-3.2-3B
- One of four emotive concept probes trained; contrastive pair bored/interested with best layer 14 in LLaMA-3.2-3B
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Strongest probe validation result; highest Cohen's d among the four concepts
- Probe validation result confirming wellbeing direction captures meaningful structure
- Weaker cross-family probe; explains weaker introspection in Gemma
- Strongest cross-family probe; explains clearer introspection in Qwen than Gemma
- LLaMA E3 geometry summary: S_max = −1.896 ± 0.211, AUS_N = −2.119 ± 0.198, peak layer ℓ* = 10 [IQR 0.384]finding0.798Seed-pooled geometry statistics for LLaMA in E3, providing quantitative basis for geometry-to-behavior correlate
- LLaMA-3.1-8B: Sbmax = -1.896 ± 0.211, AUSN = -2.119 ± 0.198, peak layer ℓ* = 10 (median)finding0.797Seed-pooled geometry-only statistics (per-dev z units).
- Interest probe score drifts positively across turns: LMM slope=0.005, p=4.12×10⁻¹⁴ in LLaMA-3.2-3Bfinding0.789Demonstrates genuine internal-state dynamics in LLMs during multi-turn conversation
- Likely-trained MM probe is a surprisingly effective causal baseline due to correlation between truth and probability on sp_en_trans