finding
active
finding:learned-rotation-matrices-are-non-trivial-majority-of-basis-vectors-are-rotated-indicating-highly-distributed-representationsLearned rotation matrices are non-trivial: majority of basis vectors are rotated, indicating highly distributed representations
Learned rotations reveal that direct probes over standard activation bases would miss the actual causal role of representations.
Source paper
extracted_from(2023) · Atticus Geiger · Zhengxuan Wu · Christopher Potts · Thomas Icard +1
Neighborhood — ranked by edge-count
Claims (1)
claim
- Supported by the finding that non-trivial rotations are required to find aligned representations.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- One of the updates about prosaic ML simulation.
- Supported by the instruction discovery experiments comparing steering vs. embedding baselines.
- Selective pressure toward convergence via task generality
- Reframing observation: the canonical K/Q/V decomposition is computationally convenient but not the most interpretable representation
- Core applied contribution claim, supported by top-k accuracy comparisons.
- Core theoretical claim about the target of representation learning
- Forward-looking interpretive claim about the implications of recurrent position encodings for NLP research.
- Author’s interpretive claim that the shared geometry is general and robust.