finding
active
finding:inceptionv1-implements-a-four-layer-circuit-for-pose-invariant-dog-head-detection-with-mirrored-left-right-pathways-that-inhibit-each-other-then-unite-exhibiting-xor-like-propertiesInceptionV1 implements a four-layer circuit for pose-invariant dog head detection with mirrored left/right pathways that inhibit each other then unite, exhibiting XOR-like properties
Evidence that neural networks learn sophisticated invariance mechanisms through structured circuits rather than loose feature aggregation
Source paper
extracted_from(2020) · Chris Olah · Nick Cammarata · Ludwig Schubert · Gabriel Goh +2
Neighborhood — ranked by edge-count
Claims (1)
claim
- Second of three speculative claims asserting that subgraphs of neural networks are tractable and meaningful objects of study
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Circuit-level evidence that polysemantic neurons arise deliberately through superposition rather than entangled computation
- InceptionV1 neuron 4e:55 responds to cat faces, fronts of cars, and cat legs as unrelated stimulifinding0.808Concrete example of polysemantic neuron demonstrating the challenge to the circuits agenda
- Demonstrates that meaningful algorithms can be read directly off floating-point weights in a neural network
- A high-level feature neuron in InceptionV1 that detects dog heads regardless of orientation, illustrating higher-level understandable features
- Striking mechanistic finding that injection creates universally detectable perturbation in residual stream immediately downstream
- Quantitative verification of the mechanistic theory; both circuits required for the induction algorithm show the predicted copying/matching structure
- Single dendritic layer solves XOR-like problems with capacity matching 8-layer deep networks.finding0.746Evidence from Beniaguev et al. (2021) that individual biological neurons vastly outperform McCulloch-Pitts model; supports hybrid computation claim.
- Synthetic theoretical example showing pernicious divergence via hidden pathway activation