concept
active
concept:change-of-basis-for-neural-representationsChange-of-Basis for Neural Representations
Key insight that rotating a neural representation to a non-standard basis can reveal distributed causal structure invisible in standard neuron-aligned basis.
Neighborhood — ranked by edge-count
Methods (1)
method
- Distributed Alignment SearchimplementsThe core method introduced in this paper: finds alignments between high-level causal variables and distributed neural representations via gradient descent.
Concepts (1)
concept
- Distributed Neural Representationsassociated_withRepresentations where individual neurons play multiple conceptual roles; patterns consisting of linear combinations of unit vectors.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Load-bearing theoretical claim providing the conceptual foundation for DAS.
- Neural Representations of Location Composed of Spatially Periodic Bands (Krupic et al., 2012)concept0.773Discovery of band cells; TEM-t also recapitulates these representations.
- Neural representation geometry causally shapes behavior; interventions respecting that geometry will yield natural trajectories.hypothesis0.771Central hypothesis tested via manifold steering experiments across language models and video world models.
- The broader conceptual framework that neural activations exhibit non-Euclidean geometric structure causally linked to behavior.
- The paper's core causal assertion: geometry is not incidental but mechanistically linked to behavior
- The model's parameters considered as the actual 'code' implementing its algorithms, as opposed to human-written code.
- Do divergent representations change what an intervention can say about an NN's natural mechanisms?question0.761Core research question motivating the paper
- Brain-based physical implementations of consciousness-related functions, assumed by many ToCs to be exclusive.