framework
active
framework:neural-geometry-frameworkNeural Geometry Framework
Conceptual scheme introduced in this paper: neural networks develop internal geometric representations that mirror real-world geometry, providing the right level of description for interpretability and control.
Neighborhood — ranked by edge-count
Papers (1)
paper
- The World Inside Neural Networksintroduces
Concepts (2)
concept
- manifoldcitesA smooth, potentially curved surface in activation space along which activations vary according to a coherent semantic dimension.
- Proposed interpretability primitives that respect the geometric structure of representations, as opposed to atomized SAE features.
Communities (1)
community
- Neural Geometrycites
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- The manifold structure of model outputs, modelled by M_y.
- The broader conceptual framework that neural activations exhibit non-Euclidean geometric structure causally linked to behavior.
- The overarching theoretical framework proposed in the paper, asserting that steering interventions should be aligned with the geometric structure of the model's representation manifold.
- The model's parameters considered as the actual 'code' implementing its algorithms, as opposed to human-written code.
- The actual shapes and spatial relationships of buildings, essential to living structure.
- Michael Johnson's prior work on how neural networks (and brains) can be 'annealed' to find optimal states.