question
active
question:lack-of-rigorous-cross-model-comparison-demonstrating-that-specific-named-features-not-just-correlated-ones-form-across-architecturesLack of rigorous cross-model comparison demonstrating that specific named features (not just correlated ones) form across architectures
Explicitly identified research gap: anecdotal evidence exists but rigorous characterization is absent
Source paper
extracted_from(2020) · Chris Olah · Nick Cammarata · Ludwig Schubert · Gabriel Goh +2
Neighborhood — ranked by edge-count
Papers (1)
paper
- Zoom In: An Introduction to Circuitsassociated_with
Claims (1)
claim
- Third of three speculative claims asserting that learned features are not model-specific but represent universal solutions to learning problems
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Bigger models are more likely to converge to a shared representation than smaller modelshypothesis0.781Selective pressure toward convergence via model capacity
- How do representations differ or converge between architectures, tasks, and modalities?question0.780Broader research question MAS is positioned to address, citing multiple recent works.
- Features may not be strictly one-dimensional objects; higher-dimensional feature manifolds may exist in model representationshypothesis0.777Extension of superposition hypothesis to account for continuous families of features
- Authors take agnostic position on ontological status but universality evidence pushes toward features being real
- Argues for intersubjective agreement about the quality of life.
- Assertion that the popular models add nothing to parallel programming.