claim
active
claim:dim-captures-only-one-facet-of-the-multi-dimensional-truth-subspace-additional-orthogonal-structure-exists-beyond-itDIM captures only one facet of the multi-dimensional truth subspace; additional orthogonal structure exists beyond it
Interpretation of Experiment 4 cosine similarity results
Source paper
extracted_from(2025) · Kevin Shengyang Yu · Vaidehi Bulusu · Oscar Yasunaga · Lau, Clayton +4
Neighborhood — ranked by edge-count
Papers (1)
paper
Findings (2)
finding
- Experiment 4 result showing DIM captures only one facet of the multi-dimensional truth subspace
- Appendix E replication of DIM alignment finding in Qwen model
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Load-bearing interpretive claim about the layer-specificity of Burger et al.'s finding.
- Interpretive synthesis of DIM and cone intervention successes
- Theoretical open question about the geometry of truth in LLMs raised in Discussion
- Mechanistic explanation for why superposition is geometrically feasible
- How can we discover a maximally informative or interpretable truth subspace rather than just a sufficient one?question0.761Limitation-driven open question about subspace optimality
- Superposition hypothesis: neural networks represent more features than dimensions using almost-orthogonal directions.hypothesis0.747Explanation for why dictionary learning can recover many more features than dimensions.
- Central interpretive claim of the paper
- Reinterpretation of Burger et al.'s finding as layer-specific rather than universal.