concept
active
concept:feature-universalityFeature Universality
Property of features that form consistently across different models trained on the same or similar data, suggesting features are real representational units
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Authors take agnostic position on ontological status but universality evidence pushes toward features being real
- The hypothesis that analogous features and circuits reliably form across different neural network models and tasks
- Property that features activate on only a small fraction of inputs; enables compressed sensing and is what allows superposition to work
- The property that every place generated by a living process is inevitably unique due to its adaptation to specific conditions.
- Is the apparent universality of some low-level vision features the exception or the rule?question0.764Open empirical question following anecdotal cross-model universality findings
- Phenomenon where a feature in a small SAE splits into multiple finer features in a larger SAE.
- The claim that truth directions are consistent and generalizable across layers, tasks, and prompt formats in LLMs.
- A feature that responds to only a single latent variable, contrasted with polysemantic features