finding
active
finding:a-sparse-set-of-28-mlp-neurons-at-layer-18-0-2-of-mlp-are-reused-across-all-cyclic-tasksA sparse set of 28 MLP neurons at layer 18 (~0.2% of MLP) are reused across all cyclic tasks
Quantitative finding identifying the specific neurons responsible for generic addition
Source paper
extracted_from(2026) · Sheridan Feucht · Tal Haklay · Usha Bhalla · Daniel Wurgaft +8
Neighborhood — ranked by edge-count
Claims (1)
claim
- Claim about the sparsity and sufficiency of the identified neuron set
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Structural finding showing modular organization within the sparse neuron set
- Key limitation of the paper's approach; MLP layers make up 2/3 of standard transformer parameters
- 512-neuron MLP continues to yield new features as autoencoder scales to 131,072 features (256× expansion)finding0.769Shows superposition enables many more features than neurons
- Hypothesis based on observed negative cosine similarity between input and output weights of some neurons
- Feed-forward neural network with hidden layers, capable of representing non-linearly separable functions.
- The sparse set of 28 neurons at layer 18 identified as responsible for Fourier feature computation across all cyclic tasks
- SAE features are not simply mirroring individual neurons.