concept
active
concept:feature-co-occurrenceFeature co-occurrence
Patterns of which features activate together across tokens; preserved by covariance pooling but lost in mean pooling.
Neighborhood — ranked by edge-count
Papers (1)
paper
- Covariance-based Sequence Poolingassociated_with
Concepts (1)
concept
- Second momentsassociated_withStatistical moments capturing pairwise feature co-occurrence patterns; the core insight is that second moments preserve structure that first moments destroy.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Property that features activate on only a small fraction of inputs; enables compressed sensing and is what allows superposition to work
- Interpretive claim that the statistically derived clusters reflect conceptual similarity or interdependence among the properties.
- Mechanistic finding by Bricken et al. 2023 about how LLMs store features; cited as operational justification for pattern-repository assumption
- A hypothesized intermediate-level linearly-represented feature (e.g., Beijing and China are closely associated) that may correlate with truth in unnegated datasets but anti-correlate in negated ones
- Metaphor treating each system feature or function as a separate application that can be independently loaded and managed.
- Property of features that form consistently across different models trained on the same or similar data, suggesting features are real representational units
- Method of optimizing input to cause a neuron to fire maximally, used to characterize what a neuron detects; establishes causal link
- Decoder cosine similarity maps onto concept similarity.