finding
active
finding:qwen-2-5-7b-wellbeing-probe-peak-cohen-s-d-3-5Qwen 2.5 7B wellbeing probe: peak Cohen's d=3.5
Strongest cross-family probe; explains clearer introspection in Qwen than Gemma
Source paper
extracted_from(2026) · Nicolas Martorell · Bianchi, Bruno
Neighborhood — ranked by edge-count
Findings (1)
finding
- Strong introspective coupling in Qwen model; demonstrates cross-family generalization of introspective capacity
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Weaker cross-family probe; explains weaker introspection in Gemma
- Probe validation result confirming wellbeing direction captures meaningful structure
- Probe validation result confirming interest direction captures meaningful structure
- Strongest probe validation result; highest Cohen's d among the four concepts
- Wellbeing probe drift is positive in Gemma (ρ=0.34 pooled turn-correlation) and Qwen (ρ=0.24); both p<10⁻⁵finding0.776Normalized probe-score drift across turns generalizes beyond LLaMA family
- Internal-state drift generalizes across scales; normalized drift also increases significantly with log(model size)
- Qwen 35B (3B active params, score 4.38) outscores Hermes 405B (405B active params, score 1.75) by 2.5xfinding0.765Parameters don't predict scores; 135x more parameters yields 60% lower score
- Smaller models show non-monotonic and diminished ASR with increasing cone dimensionality