finding

active

finding:inceptionv1-neuron-4e-55-responds-to-cat-faces-fronts-of-cars-and-cat-legs-as-unrelated-stimuli

InceptionV1 neuron 4e:55 responds to cat faces, fronts of cars, and cat legs as unrelated stimuli

Concrete example of polysemantic neuron demonstrating the challenge to the circuits agenda

Source paper

extracted_from

Zoom In: An Introduction to Circuits

(2020) · Chris Olah · Nick Cammarata · Ludwig Schubert · Gabriel Goh +2

Neighborhood — ranked by edge-count

Concepts (1)

concept

Polysemantic Neuron
supports
A neuron that responds to multiple unrelated inputs, posing a major challenge for circuit-level interpretation

Related by similarity (8)

cosine ≥ 0.65 · no typed edge

Entities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.

InceptionV1 implements a four-layer circuit for pose-invariant dog head detection with mirrored left/right pathways that inhibit each other then unite, exhibiting XOR-like propertiesfinding0.808
Evidence that neural networks learn sophisticated invariance mechanisms through structured circuits rather than loose feature aggregation
InceptionV1 spreads car feature from a pure car detector in mixed4c across dog detector neurons in the next layerfinding0.806
Circuit-level evidence that polysemantic neurons arise deliberately through superposition rather than entangled computation
Lindsey 2025: frontier models can detect and report changes in their own internal activations via concept injection experiments, demonstrating functional introspective awarenessfinding0.759
Prior finding cited as convergent evidence for LLM self-awareness capacities
What are the neuronal mechanisms by which prior beliefs from one agent's model are received and properly implemented by a naive agent (neuronal hermeneutics)?question0.750
Open question about inter-agent communication beyond model-space assumption
Weights between early and full curve detectors in InceptionV1 form a curve of positive weights at tangent positions, with opposing orientations inhibitoryfinding0.750
Demonstrates that meaningful algorithms can be read directly off floating-point weights in a neural network
Non-neural morphogenetic agents satisfy most sentience criteria via electrically active cells rather than neuronsfinding0.746
Empirical basis for expanding sentience frameworks; shows Crump criteria adaptable beyond traditional neurocentric definitions.
All neuronal processing and action selection minimize variational free energy, unifying perception, action, and learning.claim0.739
Fundamental assertion: single imperative (free energy minimization) explains diverse cognitive and neural phenomena.
Most of Crump et al.'s other criteria are met by non-neural morphogenetic agents via a simple pivot of terms to 'electrically active cell'.claim0.739
Generalization of the criteria beyond neurons.