concept
active
concept:rosetta-neuronsRosetta Neurons
Neurons activated by the same pattern across a range of vision models, forming a common dictionary independently discovered by all models
Neighborhood — ranked by edge-count
Papers (1)
paper
Thinkers (1)
thinker
- Dravid et al.introducesFound Rosetta Neurons activated by the same patterns across a range of vision models
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Cited evidence that convergence extends to the neuron level, not just representational geometry
- The sparse set of 28 neurons at layer 18 identified as responsible for Fourier feature computation across all cyclic tasks
- Biologically plausible two-pool architecture from Krotov & Hopfield (2020) splitting self-attention into feature and memory neuron populations; used to interpret TEM-t place cells.
- MLP neurons and attention heads hypothesized to delete information from the residual stream by writing the negative of what they read
- Autoencoder neurons that fail to activate across any datapoints during training; addressed via neuron resampling
- Hebrew feature is effectively invisible in the neuron basis
- Empirical basis for treating curve detectors as a canonical example of meaningful, understandable features
- Prior framework combining cellular automata with deep learning, extended by this work