question
active
question:does-the-model-have-a-feature-corresponding-to-every-major-world-citydoes the model have a feature corresponding to every major world city?
Question explored in feature completeness study.
Source paper
extracted_fromNeighborhood — ranked by edge-count
Findings (1)
finding
- Quantitative relationship between concept frequency and feature presence.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Theme issue context: relates to internal models of environment, central to consciousness and cognition across substrates.
- Core claim distinguishing this paper's contribution from looser representational similarity arguments.
- Clamping feature activations causally alters model behavior in interpretable ways.
- Authors take agnostic position on ontological status but universality evidence pushes toward features being real
- Explicitly identified research gap: anecdotal evidence exists but rigorous characterization is absent
- Author's interpretation of the VTAB alignment results echoing Tolstoy
- Third of three speculative claims asserting that learned features are not model-specific but represent universal solutions to learning problems
- Observation about asymmetry in base model capabilities.