finding
active
finding:in-a-4-features-functionally-memorize-merchantability-or-fitness-for-a-particular-purpose-via-fsa-like-feature-chainIn A/4, features functionally memorize 'MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE' via FSA-like feature chain
Demonstrates mechanistic memorization via feature assemblies in superposition
Source paper
extracted_from(2024) · Marc Carauleanu · Michael Vaiana · Judd Rosenblatt · Cameron Berg +1
Neighborhood — ranked by edge-count
Concepts (1)
concept
- Memorization in SuperpositionsupportsSpecific phrases or sequences memorized via binary features in superposition, enabling narrow pattern matching despite few neurons
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Quantitative relationship between concept frequency and feature presence.
- Vision of the emerging paradigm shift in society.
- Four features (A/0/20, A/0/0, A/0/30, A/0/494) form an FSA-like system implementing HTML tag generationfinding0.746Concrete example of features connecting into FSA-like system implementing complex behavior
- Concrete example of feature splitting revealing unexpected model structure
- Definition of the newly named empirical effect.
- General principle supported tangentially by covariance pooling work; relates to feature co-occurrence structure.
- Main functional claim about MCA.
- Authors argue features are model properties because logit effects and ablations are consistent with feature interpretations