claim
active
claim:there-is-a-many-to-many-mapping-between-neurons-and-concepts-meaning-multiple-high-level-causal-variables-might-be-encoded-in-overlapping-groups-of-neuronsThere is a many-to-many mapping between neurons and concepts, meaning multiple high-level causal variables might be encoded in overlapping groups of neurons
Fundamental theoretical claim motivating DAS, attributed to Smolensky/Rumelhart/McClelland.
Source paper
extracted_from(2023) · Atticus Geiger · Zhengxuan Wu · Christopher Potts · Thomas Icard +1
Neighborhood — ranked by edge-count
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Precise characterization of why polysemanticity poses a combinatorial obstacle to circuit analysis
- Central claim motivating DAS over prior methods.
- Necessary condition for connectionist cognition.
- Superposition hypothesis: neural networks represent more features than dimensions using almost-orthogonal directions.hypothesis0.781Explanation for why dictionary learning can recover many more features than dimensions.
- Encapsulates the core challenge of the field.
- Claim from footnote 3, acknowledging neuron-level interpretability while arguing subcomponents are better.
- Motivated by the finding that lexical entailment decomposes into word identities.