concept
active
concept:feature-interferenceFeature Interference
When non-orthogonal features cause logistic regression to identify a suboptimal probe direction
Neighborhood — ranked by edge-count
Papers (1)
paper
Concepts (1)
concept
- Maximum Margin Separatorassociated_withThe direction logistic regression converges to on linearly separable data; shown to be suboptimal for identifying truth direction
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Logit weight contributions from a feature that arise due to superposition with other features, not from the feature's own causal role
- Asymmetric transfer after fine-tuning: high-density bases (B10) are more robust.
- Property that features activate on only a small fraction of inputs; enables compressed sensing and is what allows superposition to work
- Phenomenon where a feature in a small SAE splits into multiple finer features in a larger SAE.
- Used to knock down ion channel or gap junction genes to perturb bioelectric circuits.
- A feature that responds to only a single latent variable, contrasted with polysemantic features
- Method of optimizing input to cause a neuron to fire maximally, used to characterize what a neuron detects; establishes causal link
- Domain of techniques for constructing informative features from raw data; covariance pooling is a feature engineering method for token sequences.