finding
active
finding:inceptionv1-spreads-car-feature-from-a-pure-car-detector-in-mixed4c-across-dog-detector-neurons-in-the-next-layerInceptionV1 spreads car feature from a pure car detector in mixed4c across dog detector neurons in the next layer
Circuit-level evidence that polysemantic neurons arise deliberately through superposition rather than entangled computation
Source paper
extracted_from(2020) · Chris Olah · Nick Cammarata · Ludwig Schubert · Gabriel Goh +2
Neighborhood — ranked by edge-count
Claims (1)
claim
- Interpretation of the cars-in-superposition circuit finding as an intentional representational strategy
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Evidence that neural networks learn sophisticated invariance mechanisms through structured circuits rather than loose feature aggregation
- InceptionV1 neuron 4e:55 responds to cat faces, fronts of cars, and cat legs as unrelated stimulifinding0.806Concrete example of polysemantic neuron demonstrating the challenge to the circuits agenda
- Demonstrates that meaningful algorithms can be read directly off floating-point weights in a neural network
- Curve detectors found across AlexNet, InceptionV1, VGG19, ResNetV2-50 and models trained on Places365finding0.766Anecdotal evidence for the universality of low-level visual features across different architectures and datasets
- Second low-level feature type demonstrating cross-architecture universality
- Prior finding cited as convergent evidence for LLM self-awareness capacities
- Quantitative relationship between concept frequency and feature presence.
- Highlights the non-genetic control of large-scale anatomy.