concept
active
concept:dormant-subspaceDormant Subspace
Subspace dimensions that do not vary between inputs; included in the extraneous subspace zextra.
Neighborhood — ranked by edge-count
Concepts (1)
concept
- Behavioral Null Spaceassociated_withThe span of vector directions that do not change network behavior; a key concept distinguishing MAS from model stitching.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Extension of DAS that learns a second rotation matrix on top of a fixed first one to decompose representations into sub-representations.
- Perturbations behaviorally null in one context but altering behavior in another due to latent divergence
- A vector subspace that causally impacts outputs only through the sign of its values, enabling harmless magnitude divergence
- The multi-dimensional activation subspace whose directions causally mediate truthful behavior in LLMs
- Intervention targeting specific dimensional subsets of activation vectors rather than full representations
- Subspaces whose contributions to a layer's output are canceled by opposing weight values, making them non-causally active under natural inputs
- The subspace of activation space spanned by the 171 orthogonalized emotion probe vectors, used to measure SAE feature emotional alignment
- Investigation of whether a distributed representation can be further decomposed into sub-representations encoding component identities.