claim
active
claim:analogous-features-and-circuits-form-across-models-and-tasksAnalogous features and circuits form across models and tasks.
Third of three speculative claims asserting that learned features are not model-specific but represent universal solutions to learning problems
Source paper
extracted_from(2020) · Chris Olah · Nick Cammarata · Ludwig Schubert · Gabriel Goh +2
Neighborhood — ranked by edge-count
Papers (1)
paper
- Zoom In: An Introduction to Circuitsintroduces
Findings (2)
finding
- Anecdotal evidence for the universality of low-level visual features across different architectures and datasets
- Second low-level feature type demonstrating cross-architecture universality
Hypotheses (1)
hypothesis
- Specific cross-domain prediction mentioned by neuroscientists in conversation with the authors
Concepts (1)
concept
- Universality Hypothesisassociated_withThe hypothesis that analogous features and circuits reliably form across different neural network models and tasks
Claims (2)
claim
- Second of three speculative claims asserting that subgraphs of neural networks are tractable and meaningful objects of study
- Speculative extension of universality to neuroscience, with high-low frequency detectors as a candidate prediction
Questions (2)
question
- Explicitly identified research gap: anecdotal evidence exists but rigorous characterization is absent
- Open empirical question following anecdotal cross-model universality findings
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Normative vision for how the circuits agenda could resolve the pre-paradigmatic state of interpretability
- Refinement of character-circuit overlap, stressing that self-character is not just another fiction character.
- Cited as enabling precise behavioral control through SAE features, extending the same methodological line
- How do representations differ or converge between architectures, tasks, and modalities?question0.771Broader research question MAS is positioned to address, citing multiple recent works.
- Author's interpretation of the VTAB alignment results echoing Tolstoy
- Decoder cosine similarity maps onto concept similarity.
- Assertion that the popular models add nothing to parallel programming.