finding
active
finding:the-28-mlp-neurons-at-layer-18-can-be-partitioned-into-disjoint-clusters-each-computing-the-sum-for-a-fourier-feature-with-a-different-periodThe 28 MLP neurons at layer 18 can be partitioned into disjoint clusters each computing the sum for a Fourier feature with a different period
Structural finding showing modular organization within the sparse neuron set
Source paper
extracted_from(2026) · Sheridan Feucht · Tal Haklay · Usha Bhalla · Daniel Wurgaft +8
Neighborhood — ranked by edge-count
Claims (1)
claim
- Claim about the sparsity and sufficiency of the identified neuron set
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- A sparse set of 28 MLP neurons at layer 18 (~0.2% of MLP) are reused across all cyclic tasksfinding0.870Quantitative finding identifying the specific neurons responsible for generic addition
- Fourier features with period 10 contribute to base-10 sum computation in the 28-neuron clusterfinding0.781One of the three base-10 Fourier periods identified in the sparse neuron set
- The 28 identified neurons can be partitioned into disjoint clusters each computing a different Fourier period sum
- Visual geometric evidence for the fundamental entanglement of true/false activations in harder tasks.
- Feed-forward neural network with hidden layers, capable of representing non-linearly separable functions.
- Key limitation of the paper's approach; MLP layers make up 2/3 of standard transformer parameters
- 512-neuron MLP continues to yield new features as autoencoder scales to 131,072 features (256× expansion)finding0.764Shows superposition enables many more features than neurons
- Major open problem identified in the paper; MLP layers constitute 2/3 of transformer parameters