finding
active
finding:in-qwen-2-5-9b-only-v1-has-meaningful-cosine-similarity-to-dim-direction-all-additional-basis-vectors-have-cosine-similarities-1e-9In Qwen-2.5-9B, only v1 has meaningful cosine similarity to DIM direction; all additional basis vectors have cosine similarities ~1e-9
Appendix E replication of DIM alignment finding in Qwen model
Source paper
extracted_from(2025) · Kevin Shengyang Yu · Vaidehi Bulusu · Oscar Yasunaga · Lau, Clayton +4
Neighborhood — ranked by edge-count
Claims (1)
claim
- Interpretation of Experiment 4 cosine similarity results
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Experiment 4 result showing DIM captures only one facet of the multi-dimensional truth subspace
- Core result of Experiment 3: cross-model semantic convergence under self-referential processing
- Mechanistic evidence that network actively attenuates injected perturbations, explaining late-layer introspection failure
- Shows persona space axes are inherited from pre-training, not solely created by post-training
- High cosine similarity for Gemma3 steering vectors suggests strong linear reflection structure.
- Geometric evaluation of truth direction alignment across layers and prompt templates.
- Validates that the contrast vector method and PCA-based PC1 capture the same direction
- The profound principle that underlies all living structure; symmetry as the mathematical trace of necessity.