concept
active
concept:sparse-and-smooth-codingSparse and smooth coding
Coding scheme where qualities are represented by few neurons with continuous similarity relations.
Neighborhood — ranked by edge-count
Papers (1)
paper
Concepts (1)
concept
- HOT-4: Sparse and smooth coding generating a 'quality space'associated_withIndicator: representation format providing qualitative character via similarity/discriminability.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Interpretability framework used to decompose layer-40 activations into sparse feature sets for studying emotional alignment and persistence
- Method from Gurnee et al. 2023 for finding feature directions including individual neuron analysis
- The process by which wholeness is continuously extended through structure-preserving steps without breaking the existing structure.
- The extracted set of sparse interpretable features from model embeddings via SAEs
- Primary method introduced: trains a one-hidden-layer MLP with L1 sparsity penalty to decompose model activations into overcomplete feature dictionaries
- Coherent, predictable changes in model behavior achieved by navigating along the learned manifold rather than using straight-line interventions.
- The characteristic that successive states in any natural developmental sequence are so alike as to be hardly distinguishable, even when overall change is enormous
- Used in Anthropic welfare assessment to identify performative behavior and hidden emotional struggle co-activating features