finding
active
finding:on-qwen3-1-7b-mds-achieves-1-c-5-0-sjts-vs-p2-at-4-7-and-1-c-1-4-sjts-vs-p2-at-3-6On Qwen3-1.7B, MDS achieves ϕ1,C,↑ = 5.0 (SJTs) vs P2 at 4.7, and ϕ1,C,↓ = 1.4 (SJTs) vs P2 at 3.6
Specific consciousness sweep result for Qwen3-1.7B from Table 6 demonstrating strong bidirectional steering
Source paper
extracted_from(2026) · Leonardo Blas · Robin Jia · Emilio Ferrara
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Per-model steerability comparison from Table 4
- MDS achieves global win proportion of 89.5% on SJTs across 14 LLMs and four injection stridesfinding0.765MDS dominates in open-ended generation by global win proportion metric (Table 2)
- Core finding demonstrating non-monotonic relationship between base capability and harness-benefit
- Smaller models show non-monotonic and diminished ASR with increasing cone dimensionality
- DB-MTL achieves ∆p = +1.15±0.16 on NYUv2, outperforming all baselines including state-of-the-artfinding0.761Primary empirical validation on scene understanding task
- Quantifies harness activation failure for weak-tier models vs. strong-tier models
- Opus 4.6 achieves HFR of 0.757 while Qwen3-32B achieves HFR of only 0.142 on SkillsBenchfinding0.758Quantifies harness adherence failure gap between strong and weak tier models
- Performance with a different backbone network.