finding
active
finding:impulsivity-probe-peak-cohen-s-d-3-60-layer-13-p-3-58-10-13-in-llama-3-2-3bImpulsivity probe: peak Cohen's d=3.60 (layer 13), p=3.58×10⁻¹³ in LLaMA-3.2-3B
Strongest probe validation result; highest Cohen's d among the four concepts
Source paper
extracted_from(2026) · Nicolas Martorell · Bianchi, Bruno
Neighborhood — ranked by edge-count
Concepts (1)
concept
- One of four emotive concept probes trained; contrastive pair impulsive/planning with best layer 13 in LLaMA-3.2-3B
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Probe validation result confirming interest direction captures meaningful structure
- Probe validation result confirming wellbeing direction captures meaningful structure
- LLaMA-3.2-1B impulsivity introspection: ρ=0.21, p<10⁻⁴ (significant but weaker than 3B ρ=0.52)finding0.848Impulsivity shows significant introspection in 1B but declines in 8B; non-monotonic scaling
- Impulsivity→interest: ρ increases from 0.70 (α=-4) to 0.83 (α=+4); R² from 0.46 to 0.69 in LLaMA-3.2-3Bfinding0.818Scatter plot visualization showing strengthened probe-report relationship across alpha range
- Weaker cross-family probe; explains weaker introspection in Gemma
- Third-strongest pooled introspective coupling in primary model
- Second significant cross-concept introspection improvement; marginal after BH correction (q≈0.066)
- Evidence of a bottleneck between richer internal variation and final report distribution in impulsivity→interest condition