concept
active
concept:high-order-semantic-featuresHigh-Order Semantic Features
Complex, behavior-level semantic attributes such as personality traits that the paper aims to steer.
Neighborhood — ranked by edge-count
Papers (1)
paper
Concepts (2)
concept
- Big Five Personality Traitsassociated_withCase study used for high-order semantic feature steering; a psychology taxonomy.
- Controlled Semantic Oppositionsassociated_withTechnique using semantically opposite prompts to contrast and identify features.
Related by similarity (8)
cosine ≥ 0.65 · no typed edgeEntities in the same semantic neighborhood but without a typed relation to this one — candidates for new edges or unrecognized duplicates.
- Decoder cosine similarity maps onto concept similarity.
- Representations of one's own mental states; associated with consciousness in higher-order theories.
- The meaningful organization of concepts in a model's representation space, claimed to be better captured by manifolds than by SAEs.
- Features respond to concepts across languages and in images, not just text.
- Motivating question throughout: using order theory to capture information flow, approximation, and program behavior.
- Desires through which an agent identifies with some first-order desires and repudiates others.
- The central idea that external structure binds latent patterns to desired targets.
- Mode in feature density histogram around 1e-5 corresponding to interpretable features, contrasted with ultralow density cluster