quote
active
quote:smolensky-1986-proposes-that-viewing-a-neural-representation-under-a-basis-that-is-not-aligned-with-individual-neurons-can-reveal-the-interpretable-distributed-structure-of-the-neural-representationsSmolensky (1986) proposes that viewing a neural representation under a basis that is not aligned with individual neurons can reveal the interpretable distributed structure of the neural representations.
Load-bearing theoretical claim providing the conceptual foundation for DAS.
Source paper
extracted_from(2023) · Atticus Geiger · Zhengxuan Wu · Christopher Potts · Thomas Icard +1
Neighborhood — ranked by edge-count
Concepts (1)
concept
- Representations where individual neurons play multiple conceptual roles; patterns consisting of linear combinations of unit vectors.
Methods (1)
method
- Distributed Alignment SearchsupportsThe core method introduced in this paper: finds alignments between high-level causal variables and distributed neural representations via gradient descent.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Linear representation hypothesis: neural networks represent meaningful concepts as directions in their activation spaces.hypothesis0.817Foundation for interpreting features as linear directions.
- Superposition hypothesis: neural networks represent more features than dimensions using almost-orthogonal directions.hypothesis0.816Explanation for why dictionary learning can recover many more features than dimensions.
- Neural Representations of Location Composed of Spatially Periodic Bands (Krupic et al., 2012)concept0.804Discovery of band cells; TEM-t also recapitulates these representations.
- Extends convergence argument to brain-machine alignment
- Claim from footnote 3, acknowledging neuron-level interpretability while arguing subcomponents are better.
- Motivated by the finding that lexical entailment decomposes into word identities.
- Paper explicitly identifies this as a current gap requiring alternative experimental approaches
- Opening sentence framing the paper's core inquiry.