claim
active
claim:there-appears-to-be-a-systematic-relationship-between-the-frequency-of-concepts-and-the-dictionary-size-needed-to-resolve-features-for-themThere appears to be a systematic relationship between the frequency of concepts and the dictionary size needed to resolve features for them.
Feature presence depends on concept frequency in training data, with a threshold scaling inversely with alive features.
Source paper
extracted_fromNeighborhood — ranked by edge-count
Findings (1)
finding
- Quantitative relationship between concept frequency and feature presence.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Decoder cosine similarity maps onto concept similarity.
- A combinatorial argument that good sequences are astronomically rare, emphasizing the difficulty of discovery.
- Motivation for using sparsity-based dictionary learning on language models
- what is the 'correct number of features' for dictionary learning, and is this question well-posed?question0.784Open question about whether there is a true discrete feature count or a continuous splitting process
- The profound principle that underlies all living structure; symmetry as the mathematical trace of necessity.
- what metrics can reliably tell us if dictionary learning has successfully extracted high quality features?question0.780Central methodological gap: current metrics (loss, density histograms, manual inspection) are inadequate
- Author’s interpretive claim that the shared geometry is general and robust.