claim
active
claim:approximately-0-2-of-mlp-neurons-at-layer-18-28-neurons-are-sufficient-to-account-for-the-generic-addition-computation-across-all-cyclic-tasksApproximately 0.2% of MLP neurons at layer 18 (~28 neurons) are sufficient to account for the generic addition computation across all cyclic tasks
Claim about the sparsity and sufficiency of the identified neuron set
Source paper
extracted_from(2026) · Sheridan Feucht · Tal Haklay · Usha Bhalla · Daniel Wurgaft +8
Neighborhood — ranked by edge-count
Findings (2)
finding
- A sparse set of 28 MLP neurons at layer 18 (~0.2% of MLP) are reused across all cyclic taskssupportsQuantitative finding identifying the specific neurons responsible for generic addition
- Structural finding showing modular organization within the sparse neuron set
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- 512-neuron MLP continues to yield new features as autoencoder scales to 131,072 features (256× expansion)finding0.785Shows superposition enables many more features than neurons
- Addition of neural tissue to standard brains will likely result in increased processing capacity due to adaptive design.hypothesis0.769Prediction about the plasticity of neural systems.
- SAE features are not simply mirroring individual neurons.
- Key mechanistic finding showing task-agnostic reuse of arithmetic circuitry
- Connects this study's results to Schrimpf et al. 2021 and Caucheteux et al. 2022/2023 findings on brain-LLM alignment.
- Key limitation of the paper's approach; MLP layers make up 2/3 of standard transformer parameters